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FOREWORD 


Among  the  responsibilities  assigned  to  the  Office  of  the  Manager, 
National  Communications  System,  is  the  management  of  the  Federal 
Telecommunication  Standards  Program.  Under  this  program,  the  NCS,  with  the 
assistance  of  the  Federal  Telecommunication  Standards  Committee  identifies, 
develops,  and  coordinates  proposed  Federal  Standards  which  either  contribute 
to  the  interoperability  of  functionally  similar  Federal  telecommunication 
systems  or  to  the  achievement  of  a  compatible  and  efficient  interface  between 
computer  and  telecommunication  systems.  In  developing  and  coordinating  these 
standards,  a  considerable  amount  of  effort  is  expended  in  initiating  and 
pursuing  joint  standards  development  efforts  with  appropriate  technical 
committees  of  the  International  Organization  for  Standardization,  and  the 
International  Telegraph  and  Telephone  Consultative  Committee  of  the 
International  Telecommunication  Union.  This  Technical  Information  Bulletin 
presents  an  overview  of  an  effort  which  is  contributing  to  the  development  of 
compatible  Federal,  national,  and  international  standards  in  the  area  of  High 
Definition  Television  as  related  to  telecommunications.  It  has  been  prepared 
to  inform  interested  Federal  activities  of  the  progress  of  these  efforts.  Any 
comments,  inputs  or  statements  of  requirements  which  could  assist  in  the 
advancement  of  this  work  are  welcome  and  should  be  addressed  to: 
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1.0  OVERVIEW 


This  document  summarizes  work  performed  by  Delta  Information  Systems, 
Inc.  (Delta)  for  the  National  Communications  System  (NCS),  Office  of  Technology 
and  Standards.  The  NCS  is  responsible  for  the  management  of  the  Federal 
Telecommunications  Standards  Program,  which  develops  telecommunications 
standards,  whose  use  is  mandatory  for  all  Federal  departments  and  agencies. 

This  document  is  a  final  report  for  a  Task  Order  on  Contract  DCA1 00-91 -C- 
0031 .  The  titles  for  the  contract  and  Task  Order  are  listed  below. 

•  Contract  DCA1 00-91 -C-0031 

Development  of  Federal  Telecommunication  Standards  Relating  to 
Digital  Facsimile  and  Video  Teleconferencing 

•  Task  2/Subtask  No.  6  (1992) 

High  Definition  Television 

In  recent  years,  there  has  been  considerable  activity  in  the  development  of 
technology  and  standards  related  to  High  Definition  Television  (HDTV).  The  broad 
purpose  of  this  task  is  to  analyze  the  HDTV  standardization  process  from  the 
perspective  of  the  telecommunications  interests  of  the  Federal  Government.  To 
accomplish  this  objective,  the  work  on  the  project  was  divided  into  the  four  tasks 
listed  below. 

1 .  Analyze  activities  which  are  contributing  directly  to  HDTV  standards, 

2.  Analyze  activities  which  are  contributing  to  telecommunication 
standards  which  are  related  to  HDTV, 

3.  Analyze  potential  applications  of  HDTV  in  the  Federal  Government, 

4.  Identify  and  recommend  actions  which  (1)  maximize  the 
interoperability  between  HDTV  systems  and  government 
telecommunications,  and  (2)  maximize  the  applicability  of  HDTV 
within  the  government  community. 

This  document  summarizes  the  work  performed  on  each  of  these  four  tasks 
and  an  overview  on  each  task  is  provided  below. 
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Status  of  HDTV  Standards 


•  The  FCC  and  ACATS  organizations  are  evaluating  five  proposed  HDTV 
systems,  four  of  which  are  all-digital.  It  is  a  virtual  certainty  that  one 
of  the  four  all-digital  systems  (or  a  hybrid  combination)  will  be 
selected  to  be  the  winner  in  1993. 

•  On  the  international  front,  the  Japanese  are  providing  the  MUSE 
system  (1125  lines;  60  fields/sec.)  on  an  operational  tariffed  basis, 
while  the  Europeans  are  experimenting  with  the  MAC  system  (1250 
lines;  50  fields/sec.).  While  these  systems  are  both  fundamentally 
analog  in  nature,  there  is  considerable  concern  in  these  foreign 
countries  that  the  U.S.  all-digital  standard  will  supersede  their 
systems.  For  this  reason,  they  are  supporting  work  by  the  CCIR  to 
develop  an  international  all-digital  HDTV  standard. 

Status  of  Telecommunication  Standards  Related  to  HDTV 

•  The  ISO  organization  is  developing  the  international  MPEG-2  standard 
which  is  a  toolkit  having  several  important  application  profiles  -  (1) 
HDTV,  (2)  H.26x  (video  codec  for  ATM/BISDN),  (3)  basic  NTSC 
entertainment  TV  (contribution,  distribution,  VCR),  and  (4) 
cable/satellite  systems.  The  specification  for  the  MPEG-2  standard  is 
scheduled  to  be  frozen  in  March,  1993.  Significant  effort  is  being 
made  to  align  the  FCC  HDTV  standard  with  MPEG-2  to  maximize 
interoperability. 

•  The  CCITT  ATM  Video  Coding  Experts  Group  is  developing 
Recommendation  H.26x  to  be  compatible  with  MPEG-2.  This  will 
insure  the  ability  of  an  extension  of  H.26x  to  efficiently  transmit  the 
HDTV  signal  over  the  future  ATM/BISDN  network. 

HDTV  Applications  in  the  Federal  Government 

It  Is  generally  acknowledged  that  the  use  of  television  throughout  the 
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Federal  government  is  pervasive  and  important  to  its  successful  operation.  When 
the  domestic  TV  standard  was  upgraded  from  monochrome  to  color,  there  was  a 
major  breakthrough  for  the  Federal  government.  It  is  anticipated  that  the 
conversion  from  NTSC  to  HDTV  will  be  at  least  as  revolutionary.  The  higher 
resolution  will  provide  a  far  more  effective  link  between  Federal  employees  and  the 
vast  quantities  of  information  which  is  generated  throughout  the  Federal 
government  on  an  ever-expanding  scale.  Key  application  areas  include  multimedia 
desktop  workstations,  training/distance  learning,  simulation,  command/control, 
reconnaissance/surveillance,  and  video  teleconferencing. 

Recommendations 

Based  upon  the  work  performed  on  this  project,  the  following 
recommendations  are  proposed. 

•  To  maximize  interoperability  between  future  HDTV  systems  and 
Federal  telecommunication  systems  (e.g.,  video  teleconferencing),  it  is 
recommended  that  the  domestic  HDTV  standardization  process  and 
the  international  MPEG-2  standardization  efforts  be  aligned  as  closely 
as  possible.  To  accomplish  this  alignment,  it  is  recommended  that 
Delta  continue  its  active  participation  in  both  of  these  organizations. 
To  insure  that  the  H.26x  Broadband  ISDN  video  coding  standard  is 
closely  aligned  with  these  standards  (HDTV,  MPEG-2),  it  is  further 
recommended  that  Delta  continue  its  activity  with  the  ATM  Video 
Coding  Experts  Group.  For  example,  it  is  important  that  the  MPEG-2 
standard  have  the  potentiality  for  low  delay  and  high  error  resilience 
to  permit  an  easy  extension  to  H.26x. 

•  To  maximize  the  interoperability  and  applicability  of  the  future  HDTV 
system  throughout  the  Federal  government,  it  is  recommended  that 
the  selected  system  be  compatible  with  progressive  scan  and  that 
packetized  transmission  be  employed.  This  will  maximize 
compatibility  with  computerized  systems  and  future  ATM  networks. 
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2.0  STATUS  OF  HDTV  STANDARDS 


2.1  HDTV  STANDARDIZATION  IN  THE  U.S. 

In  the  United  States,  the  term  "Advanced  Television"  (ATV),  is  a  broad  term 
which  includes  the  three  categories  listed  below. 

IDTV  -  IMPROVED  DEFINITION  TELEVISION  -  The  term  Improved  Definition 
Television  refers  to  improvements  to  NTSC  television  which  remain  within  the 
general  parameters  of  NTSC  emission  standards  and,  as  such,  would  require  little 
television  receiver  modifications  and  may  include  improvements  in  encoding, 
filtering,  ghost  cancellation,  and  other  parameters  that  may  be  transmitted  and 
received  as  standard  NTSC  in  a  4:3  aspect  ratio. 

EDTV  -  EXTENDED  DEFINITION  TELEVISION  -  The  term  Extended  Definition 
Television  refers  to  a  number  of  different  improvements  that  modify  NTSC 
emissions  but  that  are  NTSC  receiver-compatible  (as  either  standard  4:3  or  "letter¬ 
box"  format).  These  changes  may  include  one  or  more  of  the  following: 

1 .  Wide  aspect  ratio. 

2.  Extended  picture  definition  at  a  level  less  than  twice  the  horizontal 
and  vertical  emitted  resolution  of  standard  NTSC. 

3.  Any  applicable  improvements  of  IDTV. 

* 

HDTV  -  HIGH  DEFINITION  TELEVISION  -  The  term  High  Definition  Television  refers 
to  television  systems  with  approximately  twice  the  horizontal  and  vertical  emitted 
resolution  of  standard  NTSC.  HDTV  systems  are  wide  aspect  ratio  systems  and 
may  include  applicable  improvements  from  IDTV  and  EDTV.  IDTV  and  EDTV 
feature  compatibility  with  the  existing  NTSC  system  which  automatically  puts 
severe  limitations  on  their  achievable  performance.  HDTV  eliminates  this 
constraint.  This  report  is  concerned  only  with  the  standardization  of  HDTV  and  its 
implementations  for  the  Federal  Government. 

The  purpose  of  this  section  is  to  provide  an  overview  of  the  HDTV 
standardization  work  in  the  U.S.  The  description  is  divided  into  the  four  parts 
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listed  below. 


2.1.1  Federal  Communication  Commission  (FCC) 

2.1.2  Advanced  Television  Systems  Committee  (ATSC) 

2.1.3  Society  of  Motion  Picture  and  Television  Engineers  (SMPTE) 

2.1.4  Proposed  Digital  HDTV  Systems 

2.1.1  Federal  Communications  Commission 

In  1941,  the  Federal  Communications  Commission  (FCC)  made  the  first 
spectrum  allocation  and  adopted  the  first  technical  standard  for  the  transmission  of 
black  and  white  television  in  the  U.S.  [1].  Forty-six  years  later,  in  response  to  a 
joint  petition  filed  by  the  Association  of  Maximum  Service  Telecasters,  Inc.  and  57 
other  broadcast  organizations  and  companies,  the  Commission  began  a  formal 
proceeding  to  replace  that  standard.  On  July  16,  1987,  the  Commission  adopted  a 
Notice  of  Inquiry  in  the  matter  of  Advanced  Television  (ATV)  Systems  and  their 
impact  on  the  existing  television  broadcast  service. 

This  notice  went  beyond  the  basic  proposal  to  accommodate  a  new 
technology  and  raised  the  idea  of  delivering  ATV  by  over-the-air  terrestrial  means, 
that  is,  by  individual  broadcast  stations.  This  was  a  bold  challenge  considering 
that  ATV  research  had  concentrated  on  satellite  and  satellite-cable  delivery 
systems,  and  many  parties  did  not  believe  that  terrestrial  broadcast  of  high 
definition  pictures  was  practical.  Nonetheless,  in  order  to  make  the  benefits  of  an 
ATV  service  available  to  all  viewers,  the  Commission  was  of  the  view  that  existing 
television  licensees  should  be  given  the  opportunity  to  provide  that  service. 

Following  the  issuance  of  the  Notice,  the  Commission  established  the 
Advisory  Committee  on  ATV  Service  (ACATS).  This  group  operates  under  a  broad 
charter  for  the  purpose  of  advising  the  Commission  on  the  facts  and  circumstances 
regarding  ATV  systems,  and  recommending  policies,  standards,  and  regulations 
that  would  facilitate  the  orderly  and  timely  introduction  of  ATV  services  in  the  U.S. 
In  its  activities,  it  may  consider  all  technical,  economic,  legal,  and  regulatory 
issues. 

The  organizational  structure  of  the  ACATS  is  illustrated  in  Figure  2.1.  The 
Planning  Subcommittee  has  the  task  of  defining  the  desirable  characteristics  of  an 
ATV  service  and  recommending  planning  factors  for  the  establishment  of  that 


2  -  2 


FCC 


CHAIRMAN  <  ALFRED  SIKES 
CHIEF  ENG.  -  TOM  STANLEY 


ADVISORY 
ON  ADV/ 
SERVICE 

COMMITTEE 

INCED  TV 
(ACATS) 

RICHARD  WILEY  -  CHAIRMAN 

SYSTEMS 

SUBCOMMITTEE 


Irwin  Dorros 


PLANNING 

SUBCOMMITTEE 


Joseph  Flaherty 


1.  ATS  SYSTEMS 
ANALYSIS 


2.  SYSTEM  EVALUATION 
AND  TESTING 


3.  ECONOMIC  ASSESSMENT 


4.  SYSTEM  STANDARD 
Robert  Hopkins 


1.  ATS  TECHNOLOGY 
ATTRIBUTES  AND 
ASSESSMENT 


2.  ATS  TESTING  AND 
EVALUATION 
SPECIFICATIONS 


3.  ATS  SPECTRUM 
UTILIZATION  AND 
ALTERNATIVES 


4.  ALTERNATIVE  MEDIA 
TECHNOLOGY  AND 
BC  INTERFACE 
CHR.  E.  Horowitz 
V.  CHR.  R.  Sanderson 


IMPLEMENTATION 

SUBCOMMITTEE 


James  Tietjen 


1.  POLICY  AND 
REGULATION 


2.  TRANSITION 
SCENARIOS 


5.  ECONOMIC  FACTORS  AND 
MARKET  PENETRATION 


6.  ATS  SYSTEMS  SUBJECTIVE 
ASSESSMENT 


7.  AUDIENCE  RESEARCH 


Figure  2.1 

ACATS  Organizational  Structure 


service.  The  Systems  Subcommittee  is  hardware  oriented;  its  task  is  to  evaluate 
ATV  systems  under  development,  recommend  a  system  (or  specify  the  design  of  a 
system),  and  advise  on  the  appropriate  technical  standards  and  spectrum 
requirements.  Working  Party  4  (Alternative  Media  Technology)  is  particularly 
important  to  the  telecommunications  community  because  its  function  is  to  insure 
that  "inoperability"  and  "extensibility"  are  properly  considered  in  the  selection  of 
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the  HDTV  standard. 

The  Implementation  subcommittee  deals  with  transition  schemes  and  provides 
advice  on  official  policies  and  regulations  needed  to  implement  an  ATV  service.  At 
the  present  time,  the  members  are  concentrating  on  spectrum  and  implementation 
issues  and  the  evaluation  of  five  proposed  ATV  systems  at  the  ATV  Test  Center. 
This  test  center  is  a  facility  established  and  funded  by  major  elements  of  the 
television  industry  for  the  purpose  of  thoroughly  measuring  the  attributes  and 
performance  of  potential  ATV  systems  under  normal  and  adverse  conditions.  The 
results  of  the  tests  will  be  given  to  the  Systems  Subcommittee  and  will  eventually 
be  incorporated  in  the  final  report  of  the  Advisory  Committee  to  the  Commission. 

In  a  report  adopted  on  August  24,  1990,  the  Commission  made  a  critical 
technical  decision  and  established  a  rough  timetable  for  the  completion  of  this 
proceeding  and  adoption  of  an  ATV  standard.  The  Commission  stated  that  it 
intends  to  select  a  simulcast  High  Definition  Television  (HDTV)  system  for  its  ATV 
standard.  Simulcast  is  a  contraction  of  simultaneous  broadcast  and  normally 
means  the  broadcast  of  one  program  over  two  channels  in  the  same  area  at  the 
same  time.  In  this  proceeding,  simulcast  has  come  to  mean  an  independent  ATV 
signal  that  can  produce  an  advanced  picture  and  be  broadcast  simultaneously  with 
conventional  television  signals  in  the  same  area  without  causing  interference. 

Consistent  with  this  position,  the  Commission  also  said  that  it  will  give  no 
further  consideration  to  systems  that  use  additional  spectrum  for  special  signals  to 
augment  conventional  television  transmissions.  That  approach  relies  on  ATV 
receivers  to  produce  an  advanced,  high-quality  picture  by  combining  the  separate 
conventional  and  augmentation  signals.  Such  a  system  would  be  less  spectrum 
efficient  and  more  difficult  to  implement  than  the  independent,  simulcast  approach. 

Finally,  the  Commission  urged  the  Advisory  Committee  to  complete  its  work 
and  submit  a  final  report  with  recommendations  by  early  1993.  The  FCC  is 
adhering  to  that  schedule.  Consequently,  most  of  the  work  by  the  ACATS 
committee  in  1 992  was  focused  on  the  evaluation  of  the  five  proposed  HDTV 
systems  which  are  summarized  below.  The  table  includes  the  dates  when  tests 
were  performed  in  1992  by  the  Advanced  Television  Test  Center  (ATTC), 
CableLabs,  and  the  ATEL  laboratory  in  Canada. 
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PROPONENT 

NAME 

ATTC/CableLabs/ATEL 

TEST  DATES* 

NHK 

Narrow  Muse 

9/30  -  2/3 

General  Instrument, 

MIT 

DigiCipher 

12/10  -  3/31 

AT&T,  Zenith 

Digital  Spectrum 
Compatible  (DSC) 

3/4  -  6/5 

NBC,  Philips, 

Sarnoff,  Thomson 

Advanced  Digital 
Television 

6/15  -  8/14 

MIT, 

General  Instrument 

Channel  Compatible 
DigiCipher  (CCDC) 

7/24  -  10/8 

*  All  completion  dates  in  1992. 

The  first  system  transmits  the  signal  in  an  analog  format  while  the  last  four 
are  all-digital.  It  is  almost  certain  that  one  of  the  all-digital  systems,  or  some 
hybrid  combination  thereof,  will  be  selected.  The  four  all-digital  systems  are 
sufficiently  important  that  they  are  briefly  described  in  Section  2.1.4. 

Key  events  during  1992  which  contributed  to  the  evaluation  of  the  five 
proposed  HDTV  systems  are  summarized  below. 

7/15  PS-WP/4  meeting  defines  requirements  for  proponents  at  9/23 

Interoperability  Review. 

9/23-25  PS-WP/4  Interoperability  Review  in  Arlington,  VA 

+  Submission  of  written  reports  by  the  proponents 
describing  their  interoperability,  extensibility,  and 
scope  of  services/features. 

-I-  Oral  presentation  of  the  reports  to  PS-WP/4  by  the 
proponents. 
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October/  Formal  Submission  to  ACATS  of  "Proposed  Improvements"  to 

November  the  proposed  HDTV  systems  by  proponents.  It  is  anticipated 

that  some  of  these  improvements  will  be  tested  in  1993. 

December  10  Final  Meeting  of  ACATS/PS/WP-4.  Edited  and  approved  the 
three  key  output  documents. 

+  Assessment  of  the  Interoperability  of  the  HDTV 
Proposals  (Appendix  A). 

+  Analysis  of  Interoperability  Evaluation  by 
Proponent  and  Review  Board  (Appendix  B). 

+  PS-WP/4  Final  Report  (Appendix  C). 

Laboratory  testing  of  the  proposed  systems  is  essentially  complete  and  the 
following  events  are  scheduled  in  the  near  future. 

February  8-12,  1993  Meeting  of  the  ACATS  Special  Panel  to  complete  the 

evaluation  of  test  data  and  make  a  recommendation  to 
ACATS. 

February  24,  1993  ACATS  committee  will  make  its  recommendation  to  the 

FCC. 

1993  Likely  testing  of  improvements  to  one  or  more  of  the 

proposed  systems  (3  weeks  each). 

1993  Field  test  of  the  selected  system. 

An  overview  of  key  conclusions  from  the  standardization  work  by  the 
ACATS  and  PS/WP4  for  1992  as  viewed  from  the  Federal  telecommunication's 
perspective  is  provided  below. 

•  There  are  particular  characteristics  of  the  proposed  HDTV  systems 
which  are  advantageous  from  the  Federal  telecommunication's 
perspective. 

+  Progressive  scan  is  advantageous  because  artifacts  are 


minimized,  compression  is  maximized,  and  the  highest  level  of 
compatibility  with  computers  is  achieved. 

+  Packetized  transmission  with  two  levels  of  priority  is 

advantageous  because  this  is  highly  compatible  with  ATM 
telecommunication  networks. 

+  Square  pixels  are  desirable  because  they  maximize  compatibility 
with  computer  systems  which  manipulate  imagery  such  as  in  a 
windows  environment. 

•  All  of  the  digital  HDTV  proposals  have  the  potential  for  a  high  level  of 
interoperability,  extensibility,  and  scalability.  The  Sarnoff  system  is 
probably  most  interoperable  because... 

+  the  coding  algorithm  is  MPEG/H.261  based, 

+  the  data  is  transmitted  in  packets  which  is  well  adapted  to 
ATM, 

+  the  packets  have  two  levels  of  priority  which  is  well  adapted  to 
ATM,  and 

+  Headers/Descriptors  are  employed  to  provide  extensibility  and 
flexibility. 

•  One  unique  characteristic  of  the  AT&T/Zenith  DSC  system  is  that  the 
DCT  coefficients  are  addressed  using  vector  quantization.  This  is 
different  from  MPEG  and  therefore  reduces  interoperability  with 
H.261. 

•  The  AT&T/Zenith  DSC  and  DigiCipher  ATVA  employ  progressive 
scanning  and  square  pixels  and  are  therefore  highly  interoperable  with 
the  computer  medium. 

•  The  ACATS  PS-WP/4  Final  Report  (Appendix  C)  properly  highlights 
the  need  for  headers/descriptors,  square  pixels,  packetized 
transmission,  and  progressive  scan. 
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2.1.2  Advanced  Television  Systems  Committee 


The  Advanced  Television  Systems  Committee  (ATSC)  was  formed  by  the 
NAB,  NTCA,  IEEE,  EIA,  and  SMPTE  to  further  the  work  on  HDTV.  It  is  a  free 
standing  committee,  and  the  organization  is  shown  in  Figure  2.2. 
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Figure  2.2 

ATSC  Organizational  Structure 


2.1.3  SMPTE 

The  SMPTE  organization  has  been  a  major  contributor  to  the  work  in  HDTV 
standards  in  recent  years.  One  major  contribution  has  been  the  development  of 
the  240M  standard  (1125  lines,  60  fields/sec.)  for  an  analog  HDTV  signal  which  is 
used  extensively  in  TV  production  studios  in  the  US  and  elsewhere.  The  key 
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parameters  for  this  standard  are  listed  in  Table  2-1 . 

Table  2-1 

240M  Key  Parameters 


IMAGE  PARAMETERS  VALUE 


TOTAL  NUMBER  LINES  1 1 25 

NUMBER  ACTIVE  LINES  1035 

FIELD  RATE  60 

INTERLACE  2:1 

ASPECT  RATIO  16:9 

BANDWIDTH  (MHz): 

LUMINANCE  Y  =  30 

COLOR  DIFFERENCE  #1  Pr  =  15 

COLOR  DIFFERENCE  #2  Pb  =  15 


Work  is  underway  on  the  development  of  a  digital  version  of  240M.  In  the 
digital  system  there  will  be  1920  samples/line. 

In  1992,  SMPTE  was  actively  involved  in  the  two  projects  listed  below 
which  are  closely  related  to  the  HDTV  standardization  process. 

Universal  Header/Descriptor  for  Imagery 
Digital  Image  Architecture 

Universal  Header/Descriotor  for  Imagery 

A  SMPTE  group  was  formed  in  June,  1991  to  investigate  the  feasibility  of  a 
universal  header/descriptor  for  imagery.  The  purpose  of  the  header  and  descriptor 
is  provided  below. 
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•  Header 

+  header  is  universally  self  Identifying 
+  header  determines  how  to  interpret  the  payload 

+  minimal  header  (kernel)  is  identifier  and  packet  length 

+  a  packet  would  typically  be  a  frame  or  group  of  frames 

•  Descriptor 

+  descriptor  provides  application  oriented  information 
+  e.g.,  picture  size,  frame  rate,  filters;  production  information; 

copyright;  scrambling;  etc. 

+  expert  groups  establish  descriptors  to  address  their  needs 

The  committee  has  completed  its  work  and  the  results  are  summarized  in  a 
report  provided  in  Appendix  D.  Since  the  work  is  very  promising,  the  SMPTE  has 
formed  a  working  group  to  develop  a  header/descriptor  standard. 

Digital  Image  Architecture 

In  April  of  1991,  SMPTE  established  a  Task  Force  on  Digital  Image 
Architecture  which  "was  charged  with  developing  and  proposing  a  structure  for  a 
hierarchy  of  digital  image  standards  that  would  facilitate  interoperation  of  image 
systems.  The  major  objective  was  to  establish  the  basis  for  image  systems  that 
are  open,  scalable  and  extensible,  thus  meeting  the  perceived  needs  for  image 
communications  in  the  environment  likely  to  exist  as  computers,  television  and 
communications  converge,  enabled  by  pervasive  digital  technology. 

The  Task  Force,  formed  from  representatives  of  the  affected  industries  and 
applications,  has  examined  the  issues,  setting  out  those  that  are  believed  critical  at 
this  time,  and  has  modelled,  for  discussion,  further  refinement  and  testing,  one 
possibie  approach  that  meets  the  basic  requirements.  It  has  also  produced 
extensive  tutorial  information  concerning  the  matters  under  consideration. 

The  key  concepts  of  the  approach  are  that  the  conditions  for  image  systems 
are: 


Open  -  the  modules  and  interfaces  forming  the  architecture  are  fully 
defined  and  in  the  public  domain. 
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•  Interoperable  -  images  and  related  equipment  may  move  freely  across 
application  and  industry  boundaries. 

Such  systems  would  be  based  on  a  hierarchy  that  is: 

•  Scalable  -  supports  a  wide  range  of  image  capabilities, 

•  Extensible  -  future-proof  to  the  extent  possible, 

•  Compatible  -  supports  existing  television  practices  and  standards 
when  possible. 

The  Task  Force  submitted  a  Final  Report  (Appendix  E)  to  SMPTE  in  late 
1992.  It  has  been  adopted,  and  a  new  group  has  been  established  to  continue  the 
work  toward  the  development  of  a  standard. 

The  report  states  the  main  objectives  of  the  Task  Force  activity  were: 

•  To  establish  the  fundamental  properties  of  image  systems, 

•  To  examine  the  technological  trends  with  a  view  to  a  prediction  of 
future  capabilities, 

•  To  arrive  at  architectural  guidelines  that  will  achieve  the  objectives  of 
interoperability,  scalability  and  extensibility. 

The  report  details  critical  issues  in  the  development  of  a  suitable  image 
architecture  such  as: 

•  The  selection  of  a  family  of  image  acquisition  rates  and  related  display 
refresh  rates  based  on  a  progression  that  permits  display  refresh  at 
integer  multiples  of  the  acquisition  rate.  Backward  compatibility  ,  if 
required,  to  the  image  acquisition  rates  currently  in  use  (24,  50, 

59.94,  and  60)  should  be  accommodated  in  the  design  of  the 
standard  modules  which  will  interface  these  existing  systems  with  the 
digital  image  architecture, 

•  Use  of  a  square  sampling  grid  as  a  simple  and  effective  common 
expression  of  images, 

•  Selection  of  analysis  and  coding  schemes  for  color  and  luminance  that 
would  allow  useful  and  effective  scaling  of  this  image  data  while 
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maintaining  a  high  level  of  interoperability, 

•  Coherent  sampling  of  the  image  based  on  the  use  of  progressive 
scanning  techniques, 

•  Use  of  headers/descriptors  to  identify  the  content  and  conform  to  the 
characteristics  of  the  data  stream. 

2.1.4  Proposed  Digital  HDTV  Systems 

The  purpose  of  this  section  is  to  briefly  describe  the  four  all-digital  HDTV 
systems  which  have  been  proposed  to  the  FCC.  Much  of  the  information  used  in 
these  descriptions  was  taken  from  documents  prepared  by  the  proponents. 
Highlights  of  the  four  systems  are  summarized  in  Table  2-2. 

2. 1.4.1  Advanced  Digital  Television  (ADTV) 

Advanced  Digital  Television  (ADTV)  is  the  digital  high-definition  television 
(HDTV)  system  developed  by  the  Advanced  Television  Research  Consortium 
(ATRC).  Figure  2.3  illustrates  an  architectural  view  of  ADTV,  which  has  been 
designed  as  a  layered  system.  Four  principal  layers  are  shown:  the  compression 
layer,  the  data  prioritization  layer,  the  transport  layer,  and  the  transmission  layer. 
The  compression  layer  performs  the  tasks  of  video  pre/post-processing  and  MPEG 
compression/decompression.  The  prioritization  layer  has  the  tasks  of  assigning 
priorities  to  video  data  at  the  encoder,  and  recombining  the  data  elements  of 
different  priority  into  coherent  data  streams  for  decompression  at  the  decoder. 

The  data  transport  layer  is  responsible  for  service-independent  data  multiplexing, 
cell  formatting,  error  detection,  error  correction,  as  well  as  service-specific  logical 
error  recovery.  The  transmission  layer  performs  the  tasks  of  modulation,  channel 
equalization,  and  frequency  translation. 

Video  compression  in  ADTV  is  based  on  the  MPEG  standard.  While  MPEG 
compression  provides  good  image  quality  in  the  bit-rate  regime  of  interest  (i.e.,  15- 
20  Mbps  for  HDTV  resolution  images),  the  particular  robustness,  co-channel 
immunity  and  service  flexibility  requirements  of  the  broadcast  application  required 
substantial  augmentation.  The  approach  developed  by  the  ATRC  is  called 
"MPEG -I-  -I-",  In  which  an  MPEG-compatible  video  representation  is  encapsulated 
with  additional  broadcast-specific  prioritization  and  transport  layers.  A  high  degree 
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Table  2-2 

Attributes,  Characteristics,  and  Processes  of 
Digital  HDTV  Terrestrial  Broadcasting  Systems 


System  Name 

OigiCipher 

DSC-HDTV 
(Digital  Spectrum 
Compatible) 

ADTV 

(Advanced  Digital 
Television) 

CCDC-HDTV 

(Channel 

Compatible 

DigiCipher) 

Proponent 

General  Instrument, 

MIT 

AT&T,  Zenith 

NBC,  Philips, 
Sarnoff,  Thomson 

MIT, 

General  Instrument 

Lines  per  Frame 

1,050 

787/788 

1,050 

787/788 

Frames  per  Second 

29.97 

59.94 

29.97 

59.94 

Interlace 

2:1 

1:1 

2:1 

1:1 

Horizontal  Scan  Rate 

31.469kHz 

47.203kHz 

31.469kHz 

47.203kH2 

Aspect  Ratio 

16:9 

16:9 

16:9 

16:9 

Active  Video  Pixels 

1,408Hx960V  (luma) 

352H  X  480V  (chroma) 

1,280H  X  720V  (luma) 

640H  X  360V  (chroma) 

1,440H  x  960V  (luma) 

720H  X  480V  (chroma) 

1,280Hx  720V  (luma) 

640H  x  360V  (chroma) 

Pixel  Aspect  Ratio 

33:40 

1:1 

27:32 

1:1 

Bandwidth 

21.5MHz  (luma) 

5.4MHz  (chroma) 

34MH2  (luma) 

17MH2  (chroma) 

24.5MHz  (luma) 

12.25MHz  (chroma) 

34MHz  (luma) 

17MHz  (chroma) 

Colorimetry 

SMPTE  240M 

SMPTE  240M 

SMPTE  240M 

SMPTE  240M  (approx.) 

Video  Compression 
Algorithm 

Motion-compensated 

DCT  coding 

Motion-compensated 
transform  coding 

(DCT  &  VQ) 

Motion-compensated 

DCT  coding  (MPEG- 

based) 

Motion-compensated 

DCT  coding 

Block  Size 

8x8 

8x8 

8x8 

8x8 

Sampling  Frequency 

53.65MHz 

75.3MHz 

54MHz 

75.5MHz 

Audio  Bandwidth 

20kHz 

20kHz 

23kHz 

24kHz 

Audio  Compression 
Algorithm 

AC-2 

AC-2 

MUSICAM 

MIT-AC 

Audio  Sampling 
Frequency 

48kHz 

47.203kHz 

48kHz 

48kHz 

Dynamic  Range 

85dB 

96dB 

96dB 

lOOdb 

Number  of  Audio 

Channels 

4 

4 

4 

4 
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System  Name 

DigiCipher 

DSC-HDTV 
(Digital  Spectrum 
Compatible) 

ADTV 

(Advanced  Digital 
Television) 

CCDC-HDTV 

(Channel 

Compatible 

DigiCipher) 

Video  Data  Rate 

12.59Mbits/s  (16  QAM) 

17.49Mbits/s  (32  QAM) 

Automatically  varies 

from  8.6  to  17.1  Mbits/s 

17.73Mbits/s 

13.6  (16  QAM) 

18.88  (32  QAM) 

Audio  Data  Rate 

0.503Mbits/s 

0.5Mbits/s 

0.512Mbits/s  (nominal) 

0.755Mbits/s 

Control  Data 

126kbit8/s 

40kbits/s  (spare) 

40kbits/s  (data) 

126kbits/s 

Ancillary 

1 26kbits/s 

413kbits/s 

256kbits/s  (nominal) 

126kbits/s 

Sync 

N/A 

292  to  544kbits/s 

N/A 

N/A 

Total  Data 

19.51  Mbits/s  (16QAM) 

24.39Mbits/s  (32QAM) 

11.1  to  21.0Mbits/s 

24Mbits/s  (nominal) 

21.5Mbits/s  (16QAM) 

26.43Mbits/s  (32QAM) 

Error  Correction 

Overhead 

6.17Mbits/s 

1 .3  to  2.4Mbits/s 

5.5Mbits/s 

6.54Mblts/s 

RF  Modulation 

(Terrestrial) 

16QAMor32QAM 

2“level  and  4-level  VSB 

Spectrally  shaped  QAM 

16QAM  or  32QAM 

3dB  Bandwidth 

(Terrestrial) 

4.88MHz 

5.38MHz 

5.28MHz 

5.28MHz 

C/N  Threshold 

(Terrestrial) 

12.5dB  (16QAM) 

16.5dB  (32QAM) 

1  6dB  (4-level  data) 

lOdB  (2-ievet  data) 

16.1dB  (High  priority 

channel) 

ll.ldB  (Standard 

Priority  Level) 

11.7  (16QAM) 

15.7  (32QAM) 

Channel  Equalization 
(Ghost  Cancelling) 

-2  to  +  24  ms  (multiple 
ghosts) 

-2  to  -I-  20  ms  (multiple 
ghosts) 

1 6  ms  (may  be  extended 

to  40  ms) 

-4  to  +4  ms  (prototype 

hardware) 

-2  to  +  24  ms  (multiple 
ghosts) 
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of  robustness  in  the  presence  of  channel  impairments  is  achieved  by  first 
prioritizing  MPEG  encoded  video  data  into  High-Priority  and  Standard-Priority  data 
streams,  each  of  which  is  packaged  into  a  rugged  (and  flexible)  fixed-length  "cell 
relay"  type  data  format. 

The  Prioritized  Data  Transport  format  used  in  ADTV  provides  for  reliable 
transport  and  synchronization  of  MPEG  -I-  -I-  data  over  a  two-tier  physical 
transmission  system.  It  uses  a  Reed-Solomon  error  correction  code  to  fully  correct 
the  bit  errors  that  can  occur  under  moderate  noise  and  interference  conditions. 

Data  interleaving  ensures  that  even  burst  errors  (which  can  be  caused  by  impulse 
noise)  can  be  fully  corrected.  The  Prioritized  Data  Transport  format  also  provides 
error  detection  capability,  coupled  with  several  transport  features  that  support 
rapid  video  decoder  recovery  after  uncorrectable  channel  errors.  Such  events  will 
inevitably  occur  in  a  terrestrial  broadcast  channel  during  severely  impaired 
transmission,  and  must  be  handled  gracefully  by  a  receiver. 

The  high-  and  standard-priority  MPEG+  -I-  bit-streams  are  sent  over  the 
simulcast  channel  with  appropriate  transmission  priority  using  a  two-tier 
modulation  technique  which  delivers  approximately  20%  of  the  data  (i.e.,  the  high 
priority  information)  at  a  substantially  improved  reliability  level.  In  addition  to 
providing  two-tier  transmission  matched  to  the  video  prioritization  process,  ADTV's 
two-carrier  SS-QAM  approach  mitigates  the  effect  of  interference  from  NTSC  co¬ 
channel,  thus  permitting  the  low-power  operation  necessary  for  simulcast.  Finally, 
reliable  demodulation  of  the  low-power  ADTV  signal  is  achieved  without  sacrificing 
bandwidth  efficiency  through  the  use  of  an  appropriate  high-rate  trellis  code  within 
the  QAM  modem.  At  a  systems  level,  MPEG+  +  compression  and  Prioritized  Data 
Transport,  together  with  ATRC's  two-tier  SS-QAM  modulation  technique,  provide 
the  combination  of  image  quality,  robustness,  co-channel  immunity  and  flexibility 
necessary  for  the  digital  simulcast  application. 

2. 1.4.2  Channel-Compatible  DigiCipher  (CCDC) 

The  Channel-Compatible  DigiCipher  (CCDC)  HDTV  system  is  proposed  by 
the  Massachusetts  Institute  of  Technology  and  General  Instrument  Corporation.  A 
high  resolution  progressively  scanned  baseline  video  signal  of  1280  x  720  picture 
elements,  59.94  fps,  and  16:9  aspect  ratio,  can  be  transmitted  within  a  single  6 
MHz  channel.  The  CCDC  system  is  source-adaptive.  The  system  can  recognize 
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and  adapt  itself  to  the  particular  characteristics  of  the  source  format  so  that  the 
highest  quality  video  can  be  reconstructed.  The  system,  for  example, 
accommodates  different  frame  rates.  The  system  automatically  optimizes  both  the 
encoding  and  decoding  processes  so  that  the  highest  quality  video  can  be  obtained 
for  each  source  format.  The  system  will  support,  for  example,  a  wide  range  of 
source  adaptivity  including  different  frame  rates,  resolutions,  aspect  ratios, 
progressive/interlaced  scanning,  and  the  black-and-white/color  characteristics  of 
the  video. 

The  system  is  also  scalable,  in  that  performance  subsets  can  be  extracted 
from  the  CCDC  HDTV  signal.  With  proper  design  of  the  receiver,  the  received 
video  may  be  displayed  at  multiple  resolutions  with  modest  amounts  of  signal 
processing.  As  an  example,  lowpass  filtering  and  subsampling  the  signal  is  an 
effective  method  to  obtain  a  lower  spatial  resolution  image.  In  addition, 
row/column  doubling  methods  can  be  used  to  increase  the  spatial  resolution. 

A  block  diagram  of  the  CCDC  HDTV  system  is  shown  in  Figure  2.4.  The 
system  consists  of  three  parts:  video  coding,  audio  coding  and  transmission.  By 
coding,  we  mean  to  remove  redundancy  in  the  information  source  and  to  transmit 
only  the  essential  information. 

A  transmission  or  digital  communications  systems  is  required  to  link  the 
digital  outputs  of  the  various  source  coders  to  an  analog  RF  channel.  In  a  good 
design,  the  transmission  system  isolates  the  source  coder  from  the  channel  and  the 
two  problems  of  representing  the  HDTV  signal  efficiently  and  transporting  it  from 
broadcaster  to  user  become  largely  independent.  The  transmission  system  for  a 
given  medium  (terrestrial  broadcast,  cable,  satellite,  videotape,  etc.)  must  account 
for  imperfections  and  noise  in  the  channel  and  deliver  a  stream  of  bits  as  reliably  as 
possible. 

In  the  case  of  video  source  coding,  there  is  redundancy  along  all  three 
dimensions  of  the  signal.  Temporal  redundancy  is  removed  by  a  motion- 
compensated  predictive  coding  scheme  in  which  the  motion  between  the  previous 
and  current  frames  is  estimated,  a  predicted  current  frame  is  computed  based  on 
the  previous  frame  and  the  extrapolated  motion,  and  only  the  new  information  is 
sent  to  the  spatial  coder. 

Spatial  redundancy  arises  from  objects  which  have  extent  and  texture;  these 
introduce  correlations  between  neighboring  pixels.  The  correlations  are  removed 
using  the  Discrete  Cosine  Transform  (DCT).  The  DCT  coefficients  are  then 
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weighted  to  reflect  the  relative  importance  of  different  frequency  components,  and 
coefficients  which  are  perceptually  important  are  quantized  and  entropy  coded. 

In  audio  coding,  the  input  signal  is  divided  into  short  overlapping  segments, 
then  windowed  and  transformed.  These  transform  coefficients  are  then  divided 
approximately  into  the  critical  bands  of  the  human  auditory  system. 

Psychoacoustic  models  are  used  to  eliminate  inaudible  information  in  the  signal 
spectrum.  Spectral  envelope  information  is  encoded  using  a  fixed  bit  allocation 
strategy,  and  this  envelope  is  used  for  dynamic  bit  allocation  for  the  transform 
coefficients. 

The  encoded  video  data,  audio  data  and  auxiliary  data  are  multiplexed  into  a 
single  bit  stream  for  the  transmission  system.  Block  and  trellis  encoding  are  used 
to  reintroduce  some  redundancy  to  combat  channel  noise,  and  the  resulting  digital 
symbols  are  transformed  into  analog  waveforms  through  quadrature  modulation. 
The  system  can  operate  at  32  QAM  (preferred  mode)  or  16  QAM,  depending  on 
the  desired  noise  threshold/coverage  area. 

2.1. 4.3  Digital  Spectrum  Compatible  (DSC)  HDTV  System 

Zenith  and  AT&T  have  developed  the  all-digital  DSC  high-definition  television 
(HDTV)  simulcast  system.  The  system's  787.5-line  progressive  scanning  format 
eliminates  artifacts  of  interlaced  systems,  provides  full  motion  rendition  (critical  for 
sports  and  other  fast-action  programming)  and  promotes  HDTV  compatibility  with 
current  and  future  computer  and  digital  communications  technologies.  Progressive 
scanning  plus  square  pixels  and  unique  compression  technology  make  the  DSC- 
HDTV  system  well  suited  for  interconnectivity,  extensibility,  scalability  and  other 
computer  related  considerations. 

The  DSC-HDTV  system  employs  a  four-level  vestigial  sideband  (4-VSB) 
modulation  technique  which  is  complemented  by  a  two-level  digital  data  system  (2- 
VSB)  expanding  the  service  area  of  the  Digital  Spectrum  Compatible  signal.  The 
resulting  bi-rate  coding  system  identifies  and  selects  the  most  important  picture 
information  on  a  scene-by-scene  basis  and  automatically  transmits  that  data  in  a 
two-level  (binary)  mode.  The  remainder  of  the  picture  information  is  transmitted  as 
4-level  digital  data.  Two-level  digital  coding  makes  the  system  far  more  tolerant  of 
noise  and  other  interference  at  greater  distances  from  the  transmitter.  This  allows 
extended  reception  of  the  signal  beyond  the  traditional  NTSC  service  area  and 


2  -  19 


eliminates  the  so-called  "cliff  effect",  or  complete  and  abrupt  loss  of  picture, 
associated  with  some  other  all-digital  approaches. 

Picture  Format 

Digital  Spectrum-Compatible  High-Definition  Television  (DSC-HDTV)  images 
are  progressively  scanned  at  787.5  lines/frame,  59.94  frames/second.  The  aspect 
ratio  is  16:9  and  the  horizontal  line  rate  is  47.203  kHz,  three  times  that  of  NTSC. 
The  nominal  video  baseband  signal  bandwidth  is  34  KHz.  (With  a  Kell  factor  of 
0.9  for  sampling  in  the  horizontal  direction  and  an  approximate  sampling  rate  of 
75.3  MHz,  the  nominal  video  bandwidth  equals  0.9  x  75.3/2  =  33.9  MHz.) 

Transmitter  and  receiver  signal  processing  are  performed  on  square  pixels  in 
a  720  line  by  1280  pixel  array.  In  the  studio,  an  additional  guard  band  of  pixels  at 
all  four  edges  is  provided  to  allow  for  transient  effects  of  processing,  analog  rise 
times,  production  related  edge  effects  and  timing  tolerances. 

Square  pixels  are  chosen  to  facilitate  computer  interface  and  special  effects 
processing,  the  particular  numbers  offer  easy  conversion  to/from  NTSC  for 
simulcast  purposes.  Conversion  to  525  line  CCIR  601  requires  only  a  4:3 
interpolation  horizontally  and  3:2  vertically.  In  addition,  the  simple  relationship  to 
NTSC  provides  economical  means  for  designing  dual  purpose  HDTV/NTSC 
receivers. 

Progressive  scanning  has  advantages  in  video  compression  and  display  of 
motion  without  line  pairing  or  resolution  loss.  The  1280  by  720  format  is  simply 
related  to  Common  Image  format  by  a  linear  factor  of  2:3  and  is  easily  extensible 
to  higher-line-number  progressive  formats  when  they  become  practicable.  The 
data  rate  generated  by  this  format  is  within  the  capability  of  current  commercial 
high  definition  tape  recorder  technology. 

Video  Coding 

The  Video  Encoder  accepts  RGB  signals  with  SMPTE  240M  colorimetry  and 
transfer  function.  The  video  encoder  is  shown  in  Figure  2.5.  Motion  from  frame 
to  frame  is  estimated  using  a  hierarchical  block-matching  Motion  Estimator.  The 
Motion  Estimator  produces  motion  vectors,  which  are  compressed  and  sent  to  the 
output  buffer  for  transmission.  The  output  buffer  has  an  output  rate  varying 
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between  9  and  17  Mb/s  and  has  a  varying  input  rate  that  depends  on  the  image 
content.  The  buffer  history  is  used  to  control  the  coding  parameters  so  that  the 
average  input  rate  equals  the  average  output  rate.  The  feedback  mechanism 
involves  adjustment  of  the  allowable  distortion  level,  since  increasing  the  distortion 
level  (for  a  given  image  or  image  sequence)  causes  the  Encoder  to  produce  a  lower 
output  bit  rate. 

2. 1.4.4  DlgiCIpher™  HDTV  System 

The  DigiCipher™  HDTV  system  has  been  proposed  by  General  Instrument  for 
the  American  Television  Alliance.  Figure  2.6  shows  the  block  diagram  of  the 
encoder.  The  digital  video  encoder  accepts  YUV  inputs  with  16:9  aspect  ratio  and 
1050-line  interlace  (1050/2:1)  at  a  59.94  field  rate.  The  YUV  signals  are  obtained 
from  analog  RGB  inputs  by  low  pass  filtering,  A/D  conversion,  and  an  RGB-to-YUV 
matrix.  The  sampling  frequency  is  53.65  MHz  for  R,G,  and  B.  The  digital  video 
encoder  implements  the  compression  algorithm  and  generates  a  video  data  stream. 
The  digital  audio  encoder  accepts  four  audio  inputs  and  generates  an  audio  date 
stream.  The  data/text  processor  accepts  four  data  channels  at  9600  baud  and 
generates  a  data  stream.  The  control  channel  processor  interfaces  with  the  control 
computer  and  generates  control  data  stream. 

The  multiplexer  combines  the  various  data  streams  into  one  data  stream  at 
18.22  Mbps.  The  FEC  encoder  adds  error  correction  overhead  bits  and  provide 
24.39  Mbps  of  data  to  the  32-QAM  modulator.  The  symbol  rate  of  the  32-QAM 
signal  is  4.88  MHz. 

The  DigiCipher™  HDTV  system  can  also  support  1 6-QAM  transmission  that 
provides  lower  system  threshold  with  a  slight  penalty  in  picture  quality.  The  lower 
system  threshold  can  be  used  to  improve  the  ATV  coverage  area  and/or  to  reduce 
the  station  spacing.  Through  the  use  of  a  unique  design,  the  DigiCipher™  system 
shares  the  same  digital  video/audio  data  processing,  forward  error  correction,  and 
QAM  modulation/demodulation  circuities  between  the  two  operating  modes.  Table 
2-3  summarizes  the  key  parameters  of  the  DigiCipher™  HDTV  system. 

The  video  compression  process  can  be  broken  down  into  the  following 
different  subprocesses: 
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Figure  2.6 

Encoder  Block  Diagram 
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Table  2-3 

DigiCipher™  System  Parameters 


Operating  Mode 

1 6-OAM 

32-QAM 

VIDEO 

Raster  Format 

1050/2:1  Interlaced 

1050/2:1  Interlaced 

Aspect  Ratio 

16:9 

16:9 

Frame  Rate 

Bandwidth 

29.97  Hz 

29.97  Hz 

Luminance 

21.5  MHz 

21.5  MHz 

Chrominance 

5.4  MHz 

5.4  MHz 

Active  Pixels 

Luminance 

960(V)  X  1408(H) 

960(V)  X  1408(H) 

Chrominance 

480(V)  X  352(H) 

480(V)  X  352(H) 

Horizontal  Resolution 

Static 

660  Lines  per  Picture  Height 

660  Lines  per  Picture  Height 

Dynamic 

660  Lines  per  Picture  Height 

660  Lines  per  Picture  Height 

Sampling  Frequency 

53.65  MHz 

53.65  MHz 

Colorimetry 

Horizontal  Line  Time 

SMPTE  240M 

SMPTE  240M 

Active 

26.24  usee 

26.24  usee 

Blanking 

5.54  usee 

5.54  usee 

AUDIO 

Number  of  Channels 

4 

4 

Bandwidth 

20  kHz 

20  kHz 

Sampling  Frequency 

47.2  kHz 

47.2  kHz 

Dynamic  Range 

90  dB 

90  dB 

DATA 

Video  Data 

12.59  Mbps 

17.47  Mbps 

Audio  Data 

503  kbps 

503  kbps 

Async  Data  and  Text 

126  kbps 

1 26  kbps 

Control  Channel  Data 

126  kbps 

1 26  kbps 

Total  Data  Rate 

13.34  Mbps 

18.22  Mbps 

TRANSMISSION 

FEC  Data 

6.17  Mbps 

6.17  Mbps 

DATA  Transmission  Rate 

19.51  Mbps 

24.39  Mbps 

QAM  Symbol  Rate 

4.88  MHz 

4.88  MHz 

Adaptive  Equalizer  Range 

-2  to  24  usee 

-2  to  24  usee 

SYSTEM  THRESHOLD 
Noise  (C/N) 

12.5  dB 

16.5  dB 

ATV  Interference  (C/I) 

12.0  dB 

16.0  dB 

NTSC  Interference  (C/l) 

0.0  dB 

5.0  dB 

NOTE:  The  differences  in  the  two  operating  modes  are  emphasized  in  bold  letters. 
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1 .  A/D  Conversion  and  RGB-to-YUV  Matrix 

2.  Chrominance  Preprocessor 

3.  Discrete  Cosine  Transform  (DCT) 

4.  Coefficient  Quantization 

5.  Huffman  (Variable  Length)  Coding 

6.  Motion  Estimation  and  Compensation 

7.  Integration  of  Motion  Compensation  with  Intraframe  Coding 

8.  Adaptive  Field/Frame  Processing 

9.  Motion  Picture  Processing 

1 0.  Rate  Buffer  Control 

A  block  diagram  for  the  encoder  video  processing  is  shown  in  Figure  2.7. 
2.2  INTERNATIONAL  HDTV  STANDARDS  ACTIVITY 

In  1992  there  was  great  interest  and  activity  in  HDTV  around  the  world.  In 
Japan,  the  MUSE  (Multiple  Sub-Nyquit  Encoding)  system  continues  to  be 
operational  as  an  official  service  offered  to  the  public.  In  Europe,  the  MAC 
(Multiplexed  Analog  Components)  system  continues  to  be  evaluated  on  an 
experimental  basis.  Both  of  these  systems  employ  analog  transmission  techniques 
as  opposed  to  the  all-digital  approach  likely  to  be  selected  by  the  U.S.  During 
1992,  the  world  has  begun  to  believe  that  the  all-digital  approach  will  be  the 
ultimate  winner  and  has  stepped  up  their  research,  development,  and  testing  work 
in  the  all-digital  HDTV  area. 

During  1992,  there  was  considerable  activity  by  the  CCIR  toward  the 
development  of  an  international  all-digital  HDTV  standard.  Special  Rapporteur 
Group  II  (User  Requirements)  prepared  a  key  document  entitled  "Statement 
Concerning  User  Requirements  for  Source  Coding  and  Multiplexing  for  Broadcast 
Applications"  which  is  included  in  Appendix  F.  A  few  key  statements  from  this 
document  are  provided  below. 

1 .  It  is  widely  recognized  today  that  broadcasting  is  in  the  middle  of  a 
revolution  leading  towards  the  "all-digital"  broadcast  system  of  the 
future. 

2.  It  is  appropriate  at  this  time  to  draw  attention  to  the  great  interest  in 
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Figure  2.7 

Digital  Video  Encoder  Block  Diagram 
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systems  able  to  support  hierarchical  source  and  channel  coding  to 
provide  HDTV,  EDTV  and  CDTV  (STV)  transmission  and  receiver 
options  for  a  broadcast  signal  with  transmission  bit  rates  in  the  range 
20-30  Mbit/s. 

3.  In  recognition  of  the  rapid  time-scale  envisaged  for  standardization 
within  MPEG,  the  Special  Rapporteur  Group  is  charged  with  the 
production  of  its  first  Report  concerning  the  requirements  for 
broadcasting  and  secondary  distribution  in  September,  1992. 

There  are  three  Task  Groups  within  the  CCIR  that  are  working  on  HDTV 
standards. 

•  Task  Group  11/1  -  Developing  a  standard  for  HDTV  production  and 
computer  processing. 

•  Task  Group  11/3  -  Working  on  a  digital  standard  for  terrestrial 
broadcast.  Stan  Baron,  of  NBC,  is  the  Chairman. 

•  Task  Group  11/4  -  The  purpose  of  this  group  is  to  harmonize  HDTV 
work  within,  and  outside  the  CCIR.  This  group  met  in  Washington  in 
October.  To  achieve  harmony,  the  group  advocates  common  source 
algorithms,  hierarchical  coding,  and  headers/descriptors.  This  concept 
is  outlined  in  a  technical  contribution  which  is  included  in  Appendix  G. 
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3.0  STATUS  OF  TELECOMMUNICATION  STANDARDS  RELATED  TO  HDTV 


There  are  four  digital  TV  telecommunication  standards  that  are  closely 
related  to  HDTV.  These  are: 

•  H.261 

•  MPEG1 

•  H.26X 

•  MPEG2 

H.261  was  the  first  to  be  developed.  It  was  designed  for  interactive 
teleconferencing  at  bit  rates  from  64  Kbps  to  2  Mbps.  MPEG1  built  on  many  of 
the  ideas  of  H.261,  and  added  some  new  ones.  Its  primary  focus  was  VHS  quality 
video  from  digital  storage  media  at  about  1.5  Mbps,  although  a  wide  range  of 
quality  and  bit  rates  are  allowed.  MPEG2  is  an  advanced  version  of  MPEG1  with 
many  additional  features.  It  is  designed  for  higher  bit  rates  (4  and  9  Mbps  were 
used  in  simulations)  and  very  high  quality  CCIR  601  resolution.  H.26X  shares  a 
common  text  with  MPEG2,  but  selects  parameters  and  modes  of  operation  that  are 
suitable  for  interactive  communications.  It  is  designed  for  use  with  BISDN 
networks,  using  ATM  access  and  possibly  variable  bit  rates. 

The  following  sections  describe  these  standards  in  some  detail,  and  indicate 
their  relationship  to  each  other  and  to  the  proposed  HDTV  standards. 

3.1  CCITT  RECOMMENDATIONS 

3.1.1  H.261 

In  December  1990,  the  CCITT  finalized  a  set  of  five  Recommendations 
{H.261,  H.221,  H.242,  H.230,  and  H.320)  which  collectively  define  an  audiovisual 
terminal  to  provide  video  teleconferencing  (VTC)  and  video  telephony  (VT)  services 
over  the  Integrated  Services  Data  Network  (ISDN).  Since  the  basic  building  block 
of  the  ISDN  is  a  basic  channel  (BRI)  operating  at  64  Kbps,  the  generic  term  "P  x  64 
Kbps"  refers  to  operation  of  this  terminal  at  integral  values  of  P  up  to  a  maximum 
of  30. 

H.261  is  entitled  "Video  Codec  for  Audiovisual  Services  at  P  x  64  Kbps"  and 
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fully  defines  the  video  codec  function  for  the  P  x  64  audiovisual  terminal  including 
picture  format,  video  coding  algorithm  and  forward  error  control.  Figure  3.1  is  a 
functional  block  diagram  of  the  H.261  video  codec.  The  heart  of  the  system  is  the 
source  coder  which  compresses  the  incoming  video  signal  by  reducing  redundancy 
inherent  in  the  TV  signal.  The  multiplexer  combines  the  compressed  data  with 
various  side  information  which  indicates  alternative  modes  of  operation.  A 
transmission  buffer  is  employed  to  smooth  the  varying  bit  rate  from  the  source 
encoder  to  adapt  it  for  the  fixed  bit  rate  communication  channel.  A  transmission 
coder  includes  functions  such  as  forward  error  control  to  prepare  the  signal  for  the 
data  link. 


Figure  3.1 

Block  Diagram  of  the  Video  Codec 


Since  H.261  is  a  true  international  standard,  there  is  a  basic  challenge  to 
reconcile  the  incompatibility  between  European  TV  standards  (PAL,  SECAM)  and 
those  in  most  other  areas  of  the  world  (NTSC).  PAL  and  SECAM  employ  625  lines 
and  a  50  Hz  field  rate  while  NTSC  has  525  lines  and  a  60  Hz  field  rate.  This 
conflict  was  solved  by  adopting  a  Common  Intermediate  Format  (CIF)  and  QCIF 
(Quarter  CIF)  as  the  picture  structure  which  must  be  employed  for  any 
transmission  adhering  to  H.261.  The  CIF  and  QCIF  parameters  are  defined  below. 
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CIF 

QCIF 

Coded  Pictures  per  second 

29.97  (or  integral 

submultiples) 

Coded  Luminance  pixels  per  line 

352 

176 

Coded  Luminance  lines  per  picture 

288 

144 

Coded  Color  pixels  per  line 

176 

88 

Coded  Color  lines  per  picture 

144 

72 

The  QCIF  format,  which  employs  half  the  CIF  spatial  resolution  in  both 
horizontal  and  vertical  directions,  is  the  mandatory  H.261  format:  full  CIF  is 
optional.  It  is  anticipated  that  QCIF  will  be  used  for  videophone  applications  where 
head-and-shoulders  pictures  are  sent  from  desk  to  desk.  Conversely,  it  is  assumed 
that  the  full  CIF  format  will  be  used  for  teleconferencing  where  several  people 
must  be  viewed  in  a  conference  room. 

Figure  3.2  is  a  functional  block  diagram  outlining  the  H.261  source  coder. 
Interframe  prediction  is  first  carried  out  in  the  pixel  domain.  The  prediction  errors 
are  encoded  by  the  Discrete  Cosine  Transform  using  blocks  of  8  pels  x  8  pels.  The 
Transform  coefficients  are  next  quantized  and  fed  to  the  multiplexer.  Motion 
compensation  is  included  in  the  prediction  on  an  optional  basis. 


TO  VIDEO 

MULTIPLEX 

CODER 


Figure  3.2  Source  Coder 
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Picture  Structure 


In  the  encoding  process,  the  picture  is  subdivided  into  macroblocks  where 
each  macroblock  consists  of  a  2  x  2  array  of  DCT  blocks  (see  Figure  3.3).  The 
macroblock  header  defines  the  type  of  coding  to  be  performed,  possible  motion 
vectors,  and  which  blocks  within  the  macroblock  will  actually  be  coded.  There  are 
two  basic  types  of  coding.  In  Intra  coding,  coding  is  performed  without  reference 
to  previous  pictures.  This  mode  is  relatively  rare,  but  is  required  for  forced 
updating,  and  every  macroblock  must  occasionally  be  Intra  coded  to  control  the 
accumulation  of  inverse  transform  mismatch  error.  The  more  common  coding  type 
is  Inter,  in  which  only  the  difference  between  the  previous  picture  and  the  current 
one  is  coded.  Of  course,  for  picture  areas  without  motion,  the  macroblock  does 
not  have  to  be  coded  at  all. 


Luminance  Blue  Red 

Figure  3.3 

Arrangement  of  Blocks  in  a  Macroblock 


Example  of  Block  Coding 

Figure  3.4  shows  a  simple  example  of  how  each  8x8  block  is  coded.  In 
this  case,  Intra  coding  is  used,  but  the  principle  is  the  same  for  Inter  coding. 

Figure  3.4a  shows  the  original  block  to  be  coded.  Without  compression,  this 
would  take  8  bits  to  code  each  of  the  64  pixels,  or  a  total  of  51 2  bits.  First,  the 
block  is  transformed,  using  the  two-dimensional  Discrete  Cosine  Transform  (DCT), 
giving  the  coefficients  of  Figure  3.4b.  Note  that  most  of  the  energy  is 
concentrated  into  the  upper  left-hand  corner  of  the  coefficient  matrix.  Next,  the 
coefficients  of  Figure  3.4b  are  quantized  with  a  step  size  of  6.  (The  first  term 
{DC}  always  uses  a  step  size  of  8.)  This  produces  the  values  of  Figure  3.4c, 
which  are  much  smaller  in  magnitude  than  the  original  coefficients  and  most  of  the 
coefficients  become  zero.  The  larger  the  step  size,  the  smaller  the  values 
produced,  resulting  in  more  compression. 


3  -  4 


75  76  77  78  79  80  81  82 

77787980  81  828384 

79  80  81  82  83  84  85  86 

81  82  83  84  85  86  87  88 

83  84  85  86  87  88  89  90 

85  86  87  88  89  90  91  92 

87  88  89  90  91  92  93  94 

89  90  91  92  93  94  95  96 


76  76  77  79  80  81  82  83 

77  77  78  80  81  82  83  84 

79  79  80  81  83  84  85  86 

81  82  83  84  85  87  88  88 

84  84  85  87  88  89  90  91 

86  87  88  89  91  92  93  93 

88  89  90  91  92  94  95  95 

89  90  91  92  93  95  96  96 


f)  RECONSTITUTED  BLOCK 


RUN  LEVEL  CODE 

0  86  01010110 

0  -3  001011 

0  *6  001000011 

EOB  10 

TOTAL  CODE  LENGTH  »  25 

d)  COEFFICIENTS  IN  ZIG-ZAG 
ORDER  AND  VARIABLE  LENGTH 
CODED 


Figure  3.4 

Sample  Block  Coding 


The  coefficients  are  then  reordered,  using  the  Zig-Zag  scanning  order  of 
Figure  3.5.  All  zero  coefficients  are  replaced  with  a  count  of  the  number  of  zero's 
before  each  non-zero  coefficient  (RUN).  Each  combination  of  RUN  and  VALUE 
produces  a  Variable  Length  Code  (VLC)  that  is  sent  to  the  decoder.  The  last  non¬ 
zero  VALUE  is  followed  by  an  End  of  Block  (EOB)  code.  The  total  number  of  bits 
used  to  describe  the  block  is  25,  a  compression  of  20:1 . 


3.1.2  H.26X 


In  1990,  CCITT  Study  Group  XV  established  the  "Experts  Group  for  ATM 
Video  Coding”  to  develop  a  Recommendation  for  a  video  codec  for  operation  over 
future  ATM  BISON  networks.  The  group  has  met  nine  times  and  has  worked  very 
effectively  to  achieve  its  objective.  The  work  of  the  Experts  Group  has  been 
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Figure  3.5 

Scanning  Order  in  a  Block 


tightly  coordinated  with  the  ISO  MPEG  Group  {ITC1/WG1 1)  as  they  work  on  the 
MPEG2  standard.  The  integration  of  these  activities  has  been  accomplished  by  the 
same  personnel  meeting  as  an  ATM  Experts  Group  but  also  as  members  of  the 
MPEG  group.  This  process  has  proven  to  be  successful  because  the  basic 
approach  to  MPEG2  has  the  following  desirable  characteristics. 

•  There  is  a  low  delay  mode  where  B  frames  are  not  employed, 

•  there  is  much  attention  given  to  the  coding  algorithm  to  insure  a  high 
degree  of  error  resilience  which  occurs  more  in  a  telecommunication 
environment, 

•  the  MPEG2  algorithm  is  an  extension  of  the  H.261/MPEG1  algorithm 
which  makes  it  possible  to  achieve  backward  compatibility  at  a 
reasonable  cost. 

Another  comment  on  the  standardization  of  H.26X  are  provided  below. 

•  H.261  has  a  rigidly  defined  picture  format  --  QCIF,  FCIF.  It  is  likely 
that  H.26X  will  be  less  rigid  --  i.e.,  a  number  of  formats  will  be 
defined  as  options. 
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3.2  ISO  STANDARDS 


The  International  Standards  Organization  (ISO)  develops  standards  in  many 
areas,  but  is  primarily  concerned  with  computer  rather  than  communications 
issues.  It  set  up  a  working  group  in  1988  to  develop  a  standard  for  storing  and 
retrieving  video  sequences.  This  group  is  officially  ISO-IEC  JTC1/SC2/WG11,  but 
is  more  commonly  known  as  Moving  Picture  Experts  Group  (MPEG).  Other  related 
groups  are  Joint  Photographic  Experts  Group  (JPEG)  and  Joint  Bi-level  Image 
Experts  Group  (JBIG). 

3.2.1  MPEG1 

This  is  officially  International  Standard  (IS)  11172,  "Coding  of  moving 
picture  and  associated  audio-for  digital  storage  media  at  up  to  about  1.5  Mbit/s", 
Rev  2,  March  27,  1992.  It  is  the  original  standard  developed  by  the  MPEG 
committee,  and  is  sometimes  referred  to  as  simply  MPEG.  The  MPEG1  standard 
specifies  the  coded  representation  of  video  for  digital  storage  media,  and  describes 
the  decoding  process.  The  representation  supports  normal  speed  forward 
playback,  as  well  as  special  functions  such  as  random  access,  fast  play,  fast 
reverse  play,  normal  speed  reverse  playback,  pause  and  still  pictures.  It  is 
compatible  with  standard  525-  and  625-line  television  formats,  and  it  provides 
flexibility  for  use  with  personal  computers  and  workstation  displays. 

MPEG1  is  primarily  intended  for  application  to  digital  storage  media 
supporting  a  continuous  transfer  rate  up  to  about  1 .5  Mbps,  such  as  Compact  Disk 
(CD),  Digital  Audio  Tape  (DAT),  and  magnetic  hard  disks.  The  storage  media  may 
be  directly  connected  to  the  decoder,  or  via  communications  means  such  as 
busses,  LANs,  or  telecommunications  links.  MPEG1  is  intended  for  non-interlaced 
video  formats  having  approximately  288  (240  for  NTSC)  lines  of  352  pels  and 
picture  rates  around  24  Hz  to  30  Hz. 

While  MPEG1  was  built  to  a  large  degree  on  the  concepts  of  H.261,  because 
of  its  different  application  a  number  of  new  techniques  were  introduced. 

The  primary  difference  is  the  use  of  I,  P,  and  B  frames.  Rather  than  encode 
any  macroblock  in  Intra  or  Inter  mode,  MPEG1  designates  certain  frames  as  I 
frames,  to  be  coded  completely  in  the  Intra  mode.  This  makes  it  possible  to 
achieve  random  access  wherever  an  I  frame  is  used.  Other  frames  are  designated 
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as  P  frames,  and  are  coded  relative  to  a  previous  I  frame  or  a  previous  P  frame, 
primarily  using  Inter  coding  with  motion  compensation.  So  far,  this  result  could 
have  been  achieved  by  means  of  appropriate  use  of  the  tools  that  H. 261  provides. 
The  real  innovation  is  the  use  of  B  frames,  which  are  interpolated  between 
preceding  and  following  I  and  P  frames.  Interpolation  is  performed  by,  for  each 
macroblock,  predicting  pixels  relative  to  the  immediately  preceding  I  or  P  frame,  or 
the  immediately  following  I  or  P  frame,  or  both,  using  motion  vectors.  When  both 
are  used,  the  two  predictions  are  averaged.  Then  the  difference  between  the 
prediction  and  the  actual  frame  is  coded  using  DCT.  The  B  frame  itself  is  never 
used  to  predict  another  B  frame.  The  use  of  B  frames  provides  a  significant 
amount  of  additional  compression,  particularly  in  cases  where  a  moving  object 
uncovers  background,  which  cannot  be  predicted  from  a  previous  frame,  but  can 
be  from  a  subsequent  frame.  A  disadvantage  of  the  B  frame  is  that  it  involves  a 
considerable  coding  and  decoding  delay,  and  therefore  is  not  suitable  for 
interactive  applications. 

Another  innovation  is  the  use  of  motion  vectors  that  have  half-pixel 
resolution,  rather  than  the  full-pixel  resolution  used  in  H.261 .  Because  of  the 
Improved  quality  with  half-pixel  resolution,  the  use  of  loop  filtering  is  not  required. 

The  MPEG1  standard  has  been  technically  frozen  since  May  1991,  and  is 
about  to  be  formally  approved.  A  substantial  number  of  applications  are  already  on 
the  market. 

The  importance  of  MPEG1  for  HDTV  is  that  it  forms  the  basis  for  all  of  the 
proponent  digital  systems,  particularly  the  ADTV  system,  which  follows  it  very 
closely.  However,  all  of  the  proponents  use  motion-compensated  DCT  coding. 

3.2.2  MPEG2 

The  MPEG2  standard  is  derived  from  MPEG1 .  Its  primary  difference  is  that 
it  provides  higher  resolution  and  higher  quality  at  higher  bit  rates.  The  nominal 
resolution  is  CCIR  601,  which  for  NTSC  is  720  pixels  per  line  and  480  active  lines. 
The  standard  is  useful  over  a  wide  range  of  bit  rates,  but  during  its  development 
experiments  were  performed  at  4  and  9  Mbps.  At  9  Mbps,  the  quality  of  the 
decoded  Images  is  such  that  it  is  usually  very  difficult  to  distinguish  them  from  the 
original  digitized  images. 

In  addition  to  the  MPEG1  requirements,  there  are  additional  requirements  in 


3  -  8 


the  areas  of  compatibility  with  MPEG1  and  H.261,  error  and  cell-loss  resilience, 
extensibility,  scalable  decoding,  channel  hopping,  bit  stream  editing,  and  variable 
bit  rate  transmission. 

A  major  difficulty  with  achieving  higher  vertical  resolution  is  that  interlaced 
formats  must  be  dealt  with.  For  MPEG1  the  vertical  resolution  (240  lines)  was 
such  that  only  one  field  from  an  interlaced  frame  had  to  be  used.  For  MPEG2  (480 
lines)  both  fields  must  be  used.  Special  prediction  methods  are  required  to  make 
the  best  trade-off  between  using  the  pixel  that  is  ciosest  in  time  or  space  to  the 
pixel  to  be  predicted. 

It  should  be  noted  that  early  in  the  development  of  MPEG2,  an  invitation 
was  made  to  all  interested  parties  to  submit  proposed  coding  algorithms  for 
consideration.  Some  32  algorithms  were  submitted  for  evaluation.  They  included 
subband  coding,  wavelets,  and  DCT.  Each  proponent  simulated  his  algorithm  on 
standard  video  sequences.  The  resulting  decoded  sequences  were  rigorously 
evaluated  by  juries.  The  result  of  this  evaluation  was  that  the  MPEG1  algorithm 
performed  about  as  good  as  any  other  submitted.  This  was  indeed  a  fortunate 
result  for  the  cause  of  compatibility. 

MPEG2  is  currently  under  development.  The  schedule  calls  for  the  technical 
parts  of  the  standard  to  be  frozen  in  March  1993,  with  several  years  additional 
required  to  resolve  editorial  issues  and  go  through  the  approval  process. 

The  importance  of  MPEG2  to  HDTV  is  that  some  of  the  proponents  seem  to 
be  trying  to  bring  their  systems  into  alignment  with  MPEG2,  either  by  modifying 
their  systems  to  agree  with  MPEG2,  or  by  trying  to  get  MPEG2  to  adopt  their 
proprietary  algorithms.  An  example  of  the  latter  is  AT&T's  proposing  that  the 
Vector  Quantization  approach  that  they  use  in  their  HDTV  approach  also  be 
adopted  by  MPEG2. 

3.3  COMPARISON  OF  DIGITAL  TV  STANDARDS 

Figure  3.6  is  a  Venn  diagram  that  shows  the  elements  of  the  various 
standards,  and  indicates  which  of  them  are  common  to  two  or  more.  The 
proposed  ADTV  (MPEG+  +)  is  used  to  represent  HDTV,  since  it  has  more  in 
common  with  the  standards  than  any  of  the  others.  It  should  be  noted  that  this  is 
a  rough  indication  only,  since  many  of  the  parameters  are  variable,  but  are  shown 
in  their  nominal  settings.  For  example,  all  but  H.26X  are  shown  as  CBR,  even 
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though  they  can  and  do  operate  with  changing  rates,  as  can  occur  when  other 
data,  such  as  audio,  is  switched  on  or  off.  And  of  course  H.26X  can  operate  at 
CBR,  if  that  is  appropriate.  Also  caution  should  be  used  for  MPEG2,  since  it  has 
not  yet  been  finalized. 

From  Figure  3.6,  it  can  be  seen  that  the  major  differences  in  the  standards 
are  at  high  levels,  such  as  modulation,  multiplexing,  FEC,  and  picture  format.  The 
basic  low-level  video  processing  is  remarkably  constant  from  one  standard  to 
another.  The  consequence  of  this  is  that  processing  "engines"  can  be  built  in 
silicon  that  can  work  with  any  of  the  standards,  and  even  with  others  such  as 
JPEG.  This  will  result  in  great  economies  of  scale  and  competition  that  will  make 
feasible  economical  implementations  of  all  the  standards.  Even  more  important,  it 
means  that  equipment  can  be  built  that  can  work  with  all  the  standards,  including 
HDTV,  by  relatively  easy  software  modifications  at  the  higher  levels. 
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.  .  720  X  480 

MPE61  . j .  (CCIR  601) 

4-9  Mbps 

1  9/8 


KEY 


BCH 

Bose-Chaudhuri-Hocquenghem 

CBR 

Constant  Bit  Rate 

CCIR  601 

Standard  for  Digital  TV 

GIF 

Common  Intermediate  Format 

DCT 

Discrete  Cosine  Transform 

FEC 

Forward  Error  Correction 

GOB 

Group  of  Blocks 

Mbps 

Mega  bits  per  second 

MUX 

Multiplex 

QAM 

Quadrature  Amplitude  Modulation 

QCIF 

Quarter  CIF 

RS 

Reed-Solomon 

VBR 

Variable  Bit  Rate 

VLC 

Variable  Length  Coding 

n  X  m 

Luminance  format  (pixels  x  lines) 

p:q 

Picture  aspect  ratio  (width: height) 

r/s 

Pixel  aspect  ratio  (height/width) 
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4.0  HDTV  APPLICATIONS  IN  THE  FEDERAL  GOVERNMENT 


It  is  generally  acknowledged  that  the  use  of  television  throughout  the 
Federal  government  is  pervasive  and  most  critical  to  its  successful  operation. 

When  the  domestic  TV  standard  was  upgraded  from  monochrome  to  color,  there 
was  a  major  breakthrough  for  the  Federal  government.  It  is  anticipated  that  the 
conversion  from  NTSC  to  HDTV  will  be  at  least  as  revolutionary.  The  higher 
resolution  will  provide  a  far  more  effective  link  between  Federal  employees  and  the 
vast  quantities  of  information  which  is  generated  throughout  the  Federal 
government  on  an  ever-expanding  scale.  Key  application  areas  include  multimedia 
desktop  workstations,  training/distance  learning,  simulation,  command/control, 
reconnaissance/surveillance,  and  video  teleconferencing.  These  applications  are 
discussed  below. 

Multimedia  Desktop  Workstations 

With  the  advent  of  HDTV,  the  cost  of  high  resolution  computer  displays  on 
the  desktop  will  drop  radically.  This  will  permit  the  cost  effective  merging  of 
several  media  (facsimile,  videophone,  teleconferencing,  computer  I/O)  at  the 
desktop.  Multimedia  services  involve  the  integrated  delivery  and  control  of 
multiple  information  types  within  a  single  service,  e.g.,  video,  audio,  imagery,  and 
test.  Multimedia  communication  will  become  increasingly  important  as  multimedia 
applications  become  more  prominent  in  workstation  and  personal  communications 
environments.  The  basic  framework  for  a  desktop  multimedia  workstation  is  a 
display  having  high  resolution  comparable  to  the  HDTV  formats.  There  is  therefore 
a  strong  linkage  between  the  multimedia  workstation  environment  and  HDTV. 

Many  of  the  functions  previously  associated  with  the  multiplexing  of 
components  of  a  multimedia  connection,  and  embodied  in  end-to-end  signalling  and 
framing  systems,  such  as  Recommendation  H.221,  are  provided  by  basic  ATM 
functionality.  Therefore  the  cost-effectivity  of  multimedia  desktop  workstations 
will  increase  sharply  with  the  advent  of  HDTV  and  broadband  ISDN. 

Training  &  Distance  Learning 

Television  plays  a  major  role  in  the  training  of  all  Federal  employees,  but  is 
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particularly  important  in  the  DoD.  It  is  claimed  by  some  that  the  Desert  Storm 
success  was  due  largely  to  the  excellent  training  throughout  the  DoD. 

In  many  cases,  the  quality  of  the  training  mission  is  impaired  by  the  low 
resolution  of  NTSC.  For  example,  in  many  cases,  it  is  desired  to  present  detailed  , 
graphics  and  diagrams  to  observers.  HDTV  will  significantly  enhance  this 
capability.  In  the  training  application,  the  development  of  HDTV  storage/retrieval 
equipment  is  as  important  as  the  cameras  and  displays. 

Using  the  HDTV  aspect  ratio  of  16:9,  it  becomes  easy  to  include  subtitles, 
data,  and  other  Information  as  explanatory  material  on  either  the  right  or  left  side 
of  the  screen,  or  to  divide  the  picture  into  two  halves  and  display  them  side  by 
side. 

In  general,  the  quality  of  the  training  increases  as  the  student  has  the 
impression  of  being  "present"  with  the  teacher.  HDTV  creates  a  greater  sense  of 
presence  than  NTSC. 

Simulation 

Simulation  activity  has  been  rapidly  expanding  in  the  DoD  in  recent  years 
because,  for  example,  it  is  far  less  expensive  to  "simulate"  cockpit  imagery  than  to 
fly  airplanes.  But  the  quality  of  the  simulators  has  been  restricted  by  the  resolution 
of  TV  displays  having  reasonable  cost.  HDTV  will  radically  improve  the 
performance  and  cost-efficiency  of  simulation  work  in  the  DoD  and  other  Federal 
agencies. 

Command  and  Control 


Television  plays  a  critical  role  in  the  performance  of  the  command  and 
control  function  throughout  the  DoD.  The  use  of  TV  for  battlefield  status  displays 
is  absolutely  critical,  and  there  is  no  question  that  HDTV  will  provide  a  quantum 
leap  in  the  ability  to  access,  process,  and  most  importantly,  display  a  large  quantity 
of  command/control  information  in  an  effective,  practical  and  inexpensive  manner. 
HDTV  will  impact  the  DoD  command/control  function  at  the  desktop  as  well  as 
large  command  centers. 

A  typical  command  and  control  function  involves  the  transmission  and 
display  (desk  monitor  or  large  display  board)  of  a  map  with  superimposed  text  and 
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vector  lines.  The  HDTV  format  is  obviously  ideal  for  this  application.  It  is  useful  to 
note  that  In  many  situations  only  a  small  part  of  the  image  changes  at  one  time.  In 
such  a  case,  the  DCT/Motion  Compensation  coding  technique  could  readily 
transmit  the  HDTV  signal  using  existing  telecommunications  or  ISDN  channels. 

Assume  an  HDTV  frame  requires  10®  bits  for  definition  (0.5  bits/pixel).  If 
the  channel  rate  were  384  Kbps,  an  image  can  be  completely  updated  in 
approximately  3  seconds.  If  only  a  small  part  of  the  image  changes,  that  change 
can  be  accomplished  virtually  instantaneously. 

Reconnaissance/Surveillance 


The  DoD  is  capable  of  acquiring  vast  quantities  of  reconnaissance  and 
surveillance  information  from  satellites,  aircraft,  and  ground  based  platforms.  In 
many  cases,  where  it  is  difficult  to  store  and  display  this  information  cost 
effectively  to  commanders  and  analysts,  HDTV  will  significantly  help  address  this 
problem. 

Video  Teleconferencing 

The  purpose  of  video  teleconferencing  (VTC)  is  to  provide  a  sense  of  visual 
presence  between  groups  of  people  in  remote  locations.  One  of  the  disadvantages 
of  existing  VTC  systems  is  that  this  sense  of  presence  is  limited  in  two  ways. 

First,  if  there  is  a  large  group  of  people  in  a  conference  room,  the  image  of  one 
person  usually  occupies  a  small  fraction  of  the  screen  and  therefore  is  not  seen 
clearly.  The  use  of  HDTV  in  such  a  situation  would  greatly  improve  the  ability  to 
sense  the  facial  expressions  and  body  language  of  persons  in  the  remote  location. 

The  second  problem  with  low  TV  resolution  in  existing  systems  is  the 
inability  to  reproduce  high  resolution  graphic  scenes  with  the  required  clarity.  At 
the  present  time,  an  8 14  x  11  document  is  typically  reproduced  by  the  video 
camera  in  two  steps  --  top  half  first,  then  the  bottom  half.  An  HDTV  system 
would  permit  the  entire  document  to  be  viewed  on  one  screen  which  would  be  an 
important  improvement. 
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5.0  CONCLUSIONS 


The  following  conclusions  are  drawn  from  the  work  performed  on  this 
project. 


•  There  is  a  general  trend  toward  the  adoption  of  a  domestic  standard 
for  HDTV  transmission  based  upon  all-digital  technology.  At  the 
present  time,  there  are  four  proponents  of  all  digital  systems  as  listed 
below. 


PROPONENT  TEAM 

SYSTEM 

CHROMA 

PIXELS 

AT&T,  ZENITH 

SPECTRUM  COMPATIBLE 

360  X  640 

GENERAL  INSTRUMENT,  MIT 

DIGICIPHER 

480  X  352 

SARNOFF,  NBC,  PHILIPS, 

ADVANCED  COMPATIBLE  TV 

480  X  720 

THOMSON 

GENERAL  INSTRUMENT,  MIT 

ATV  PROGRESSIVE 

360  X  640 

All  four  proposed  systems  employ  DCT  coding  (8x8  pixels)  and 
motion  compensation,  which  is  similar  to  the  coding  technique 
employed  in  CCITT  Recommendations  H.261,  H.26x,  MPEG-1,  and 
MPEG-2.  This  trend  toward  all-digital  transmission  and  alignment  with 
existing  standards  is  obviously  a  very  favorable  development  for  the 
federal  community  interested  in  telecommunications. 

•  There  are  two  standardization  activities  proceeding  in  parallel  which 
are  highly  inter-related;  (1)  the  U.S.  HDTV  standard  developed  by  the 
FCC,  (2)  the  international  MPEG-2  standard  by  ISO.  The  two 
organizations  developing  these  standards  are  attempting  to  achieve  as 
much  compatibility  as  possible.  It  is  planned  to  freeze  the 
specifications  for  both  standards  in  1993. 

•  On  the  international  front,  the  Japanese  are  providing  the  MUSE 
system  (1125  lines;  60  fields/sec.)  on  an  operational  tariffed  basis. 
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while  the  Europeans  are  experimenting  with  the  MAC  system  (1250 
lines;  50  fields/sec.).  While  these  systems  are  both  fundamentally 
analog  in  nature,  there  is  considerable  concern  in  these  foreign 
countries  that  the  U.S.  all-digital  standard  will  supersede  their 
approaches.  For  this  reason,  they  are  supporting  work  by  the  CCIR  to 
develop  an  international  all-digital  HDTV  standard. 

•  It  is  anticipated  that  the  HDTV  system  will  have  a  major  impact  within 
the  Government  community.  Examples  of  important  applications  are 
multimedia  desktop  workstations,  training/distance  learning, 
simulation,  command/control,  reconnaissance/surveillance,  and  video 
teleconferencing. 
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6.0  RECOMMENDATIONS 


Based  upon  the  work  performed  on  this  project,  the  following 
recommendations  are  proposed. 

•  To  maximize  interoperability  between  future  HDTV  systems  and 
Federal  telecommunication  systems  (e.g.,  video  teleconferencing),  it  is 
recommended  that  the  domestic  HDTV  standardization  process  and 
the  international  MPEG-2  standardization  efforts  be  aligned  as  closely 
as  possible.  To  accomplish  this  alignment,  it  is  recommended  that 
Delta  continue  its  active  participation  in  both  of  these  organizations. 
To  insure  that  the  H.26x  Broadband  ISDN  video  coding  standard  is 
closely  aligned  with  these  standards  (HDTV,  MPEG-2),  it  is  further 
recommended  that  Delta  continue  its  activity  with  the  ATM  Video 
Coding  Experts  Group.  For  example,  it  is  important  that  the  MPEG-2 
standard  have  the  potentiality  for  low  delay  and  high  error  resilience 
to  permit  an  easy  extension  to  H.26x. 

•  To  maximize  the  interoperability  and  applicability  of  the  future  HDTV 
system  throughout  the  Federal  government,  it  is  recommended  that 
the  selected  system  be  compatible  with  progressive  scan  and  that 
packetized  transmission  be  employed.  This  will  maximize 
compatibility  with  computerized  systems  and  future  ATM  networks. 
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APPENDIX  A 


INTEROPERABILITY  ASSESSMENTS 


Reference  1 


INTEROPERABILITY  ASSESSMENTS 


December  11, 1992 


StellaCom,  Inc. 

Advanced  Television  Systems 
1901  N.  Moore  Street  •  Suite  601 
Rosslyn,  VA  22209-1706 


(703)  243-1695  (p) 
(703)  243-1698  (f) 


This  work  was  done  for  ACATS  PS/WP-4. 


INTRODUCTION 


This  document  was  written  by  SteliaCom,  Inc.  for  PS-WP/4,  a  working  party  of  the 
Advisory  Committee  on  Advanced  Television  Systems  (ACATS)  which  operates  on  behalf  of  the 
Federal  Communications  Commission.  The  information  contained  in  this  document  was 
obtained  from  documents  submitted  by  the  proponents  to  PS-WP/4  and  to  other  working  parties 
of  ACATS,  and  from  oral  presentations  by  the  proponents  In  September,  1992  in  which 
questions  from  PS-WP/4  were  answered. 

It  was  the  intention  of  PS-WP/4  for  this  document  to  specify  whether  each  feature 
discussed  was 

1 )  in  the  system  tested  at  ATTC, 

2)  promised  for  the  system  to  be  field  tested, 

3)  promised  for  the  system  after  field  testing,  or 

4)  possible/realistic  at  a  future  time. 

This  was  done  to  the  extent  possible  with  the  information  available.  However,  the  proponents 
are  continuing  to  enhance  their  systems  and  revise  their  time  estimates  for  the  implementation 
of  features.  Some  of  the  enhancements  under  way  now  are  in  response  to  the  concerns  of  PS- 
WP/4  on  Interoperability,  Scope  of  Services  and  Features,  and  Extensibility. 


Financial  support  was  provided  1^  /^ple  Computer,  Inc.,  Bellcore,  Cable 
Television  Laboratories,  Inc.,  Eastman  Kodak  Company,  Digital  Equipment 
Corporation,  and  Viacom  International,  Inc. 
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Advanced  Digital  HDTV  InteroDerabllftv  Assessment 


System  Description 

The  Advanced  Digital  HDTV  (AD-HDTV)  System  from  the  Advanced  Television  Research 
Consortium  is  an  all-digital  system  with  1050  scan  lines  per  frame  and  29.97  frames  per  second. 
Compression  and  transmission  are  in  a  29.97  hz  progressive  form.  The  system  tested  used  a  2:1  < 

interlaced  source  and  a  2:1  interlaced  display.  The  display  format  of  the  prototype  system  has  a  16:9 
image  aspect  ratio  with  960  lines  per  frame  and  1248  pixels  per  line,  although  future  versions  are 
expected  to  use  1440  pixels  per  line.  The  proponent  also  claims  that  the  system  can  be  changed  to 
have  square  pixels.  Features  of  the  system  include  a  prioritized  data  transport  format  that  separately 
packages  video,  audio  and  data  and  allows  their  mix  to  vary  dynamically.  It  also  includes  two-tier 
prioritization  where  each  data  segment  is  assigned  to  a  high-priority  (HP)  data  stream  or  a  standard 
priority  (SP)  data  stream.  Transmission  is  by  Spectrally  Shaped  QAM  (SS-QAM)  consisting  of  two  4-  or 
5-bit-per-symbol  QAM  signals  fitted  between  co-channel  NTSC  carriers.  Each  data  stream  modulates  a 
16-  or  32-QAM  carrier,  with  the  high-priority  carrier  transmitted  at  higher  power.  This  allows  a  degree  of 
graceful  degradation  on  the  edge  of  the  broadcast  contour  and  transmission  robustness  within  the 
service  area,  while  minimizing  interference  to  and  from  co-channel  NTSC.  It  also  allows  a  degree  of  bit- 
stream  scalability. 


I.  Interoperability 

A.  with  Cable  TV 

The  tests  of  the  proponent  systems  at  ATTC  by  Cable  Labs  should  determine  the  critical 
aspects  of  interoperability  with  cable  TV.  Other  factors  that  may  aid  cable  TV  Interoperability  include 
channel  augmentation  to  9  or  12  MHz  for  improved  picture,  or  better,  going  to  a  higher  data  rate  that 
can  be  supported  in  the  6  MHz  channel  by  the  relatively  benign  cable  environment.  The  question 
remains  whether  HDTV  receivers  can  be  built  to  support  higher  numbers  of  bits  per  symbol  without  cost 
impact. 

Encryption  and  addressing  are  important  service  features  for  Cable  TV.  See  Section  11,  items  F 

and  G. 


B.  with  digital  technology 

Since  this  system  is  all-digital,  the  advantages  of  all-digital  systems  apply. 

C.  with  headers/descriptors 

The  AD-HDTV  system  formats  all  its  data  in  a  layered  manner  following  the  MPEG  syntax.  After 
synchronization,  the  data  link  layer  identifies  the  service  type.  Within  the  adaptation  level,  the 
adaptation  header  contains  information  governing  the  packing  of  variable-length  code  words  for  video, 
and  information  used  in  recovery  after  channel  changes  or  errors'.  The  video  service  level  includes  the 
actual  encoded  video  information. 

All  data  sent  by  the  AD-HDTV  system  is  grouped  into  fixed-length  cells  that  contain  data  of  a 
single  particular  type.  The  cells  are  148  bytes  long,  starting  with  a  single  byte  for  synchronization,  an  8- 
blt  Barker  sequence  (0001 1101).  This  is  followed  by  a  single  byte  for  a  service  header  that  defines  cell 
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types  at  the  link  layer.  Next  Is  a  4-byte  adaptation  level  header  that  contains  pointers  to  the  start  of  a 
slice  within  the  picture  frame,  as  well  as  the  identity  of  the  slice  within  the  frame.  These  are  followed  by 
a  120-byte  information  payload  containing  actual  data  for  the  cell’s  particular  service  type.  At  the  end 
of  the  cell  is  a  2-byte  frame  check  sequence  (FCS)  trailer,  and  a  20-byte  forward-error-correction  (FEC) 
trailer. 

The  adaptation  layer  pointers  and  slice  identification  information  provide  re-entry  points  within 
the  codec  video  data,  making  it  possible  to  begin  decoding  at  a  known  point  after  an  error  event  or 
channel  change  that  requires  a  restart  for  some  or  all  of  the  video  decoding.  < 

D.  with  NTSC 

The  proponent  selected  the  field  rate  of  59.94  Hz  for  compatibility  with  NTSC.  The  number  of 
active  video  lines,  960,  was  selected  to  be  double  the  number  of  active  NTSC  lines,  480.  (NTSC 
normally  has  483  active  video  lines,  but  483  is  an  awkward  number  for  conversions.)  Since  the  aspect 
ratio  of  this  system,  16:9,  is  different  from  the  aspect  ratio  of  NTSC,  4:3,  a  choice  must  be  made  for  the 
conversion.  Two  choices  for  down-conversion  are  1)  "Edge  Crop",  In  which  the  HDTV  picture  fills  480 
lines  on  NTSC  with  loss  of  the  sides  of  the  HDTV  picture,  and  2)  "Letterbox",  in  which  the  full  width  of 
the  HDTV  picture  is  displayed  in  the  full  width  of  the  NTSC  picture,  but  leaving  unused  areas  at  the  top 
and  the  bottom  of  the  NTSC  screen.  Pixel  values  from  HDTV  lines  are  stored  In  memory  and  read  out 
at  reduced  speed  to  make  NTSC  lines.  The  range  of  pixels  read  and  the  clocking  rate  depend  upon 
the  choice  of  edge-crop  or  letterbox.  The  down-conversion  involves  interpolation  between  HDTV  pixels 
in  a  line  and  between  HDTV  lines. 

E.  with  film 

it  is  claimed  that  AD-HDTV  will  support  an  "electronic  film"  format  that  makes  use  of  the 
redundant  field  to  improve  quality.  Since  film  has  a  slower  (24  frames  per  second)  temporal  rate,  AD- 
HDTV  will  scan  its  1 050-line  raster  progressively  for  film  at  24  frames  per  second,  with  the  same  1 440  x 
960  pixel  format  that  it  uses  with  video  sources.  However,  in  film  productions  where  computer  graphics 
are  used  extensively,  square  pixels  may  be  more  desirable  in  the  image  representation.  It  is  claimed 
that  AD-HDTV  will  also  provide  a  progressively  scanned  1440  x  810  square  pixel  format  to 
accommodate  this  source.  Most  receivers  will  perform  3:2  pull-down  to  convert  to  their  59.92  Hz  field 
rate,  but  higher-cost  receivers  could  use  3:1  frame  repeat  to  display  at  72  Hz. 

F.  with  computers 

Progressive  scanning  and  square  jxxels  are  Important  factors  for  interoperability  of  an  HDTV 
system  with  computers  -  nearly  all  existing  bit-mapped  computer  graphics  displays  have  these 
features.  Progressive  scanning  and  square  pixels  are  most  critical  for  re^-time  applications  such  as 
display,  scan-conversion,  frame  capture,  and  video  effects.  Computers  are  expected  to  play  an 
increasing  role  in  video  image  generation  and  production  and  it  is  desirable  to  have  an  HDTV  format 
which  allows  direct  display  and  manipulation  of  HDTV  video  on  the  computer.  Progressive  scanning  is 
preferable  for  computer  applications  to  avoid  artifacts  that  are  common  with  interlaced  display  of 
computer  generated  imagery.  Square  pixels  are  inherent  to  graphics  display  hardware  for  all  popular 
computers  employing  bit-mapped  displays.  The  main  reason  is  that  it  facilitates  processing  of  2D 
transformations,  especially  rotations. 

Encoding  and  transmission  in  AD-HDTV  are  done  in  progressive  form,  favoring  interoperability 
with  computers,  although  testing  of  this  system  has  been  done  with  interlaced  sources  and  displays 
requiring  format  conversions.  ATRC  has  suggested  that  the  system  will  eventually  use  progressive 
sources  and  displays.  With  1440  pixels/line,  the  pixels  in  AD-HDTV  are  18.5%  wider  than  high,  and 
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with  1248  pixels/line  they  are  37%  wider  than  high.  ATRC  has  suggested  that  their  system  can  provide 
square  pixels  by  reduction  of  the  number  of  active  scan  lines  to  810. 

This  system  also  offers  the  possibility  of  interoperability  with  MPEG-based  computer 
applications.  See  Section  J.3  below. 

G.  with  satellites 

Digital  signals  with  bit  rates  used  by  the  proponents  for  terrestrial  transmission  can  generally  be 
accommodated  in  the  available  bandwidth  of  a  satellite  transponder.  The  total  data  rate  for  the  AD- 
HDTV  system  in  the  mode  tested  by  ATTC  is  24.0  Mbps.  The  proponent  has  suggested  removing  the 
0.9-rate  trellis  code  used  with  SS-QAM,  making  the  net  data  rate  21.6  Mbps.  The  proponent  does  not 
anticipate  the  need  for  any  additional  error  correction  for  satellite  transmission,  although  convolutional 
coding  is  normally  used.  The  proponent  stated  that  three  AD-HDTV  channels  may  be  carried  in  a 
transponder.  However,  it  is  unlikely  that  more  than  two  AD-HDTV  channels  will  be  carried  in  a  typical 
36-MHz  transponder.  See  the  link  analysis  done  for  the  DSC-HDTV  signal.  The  proponents  also  point 
out  that  it  is  possible  to  carry  AD-HDTV  and  NTSC  signals  on  the  same  transponder. 

H.  with  packet  networks 

The  data  link  sublayer  format  is  based  on  a  "cell  relay"  asynchronous  time-division  multiplexing 
concept  similar  to  the  eisynchronous  transfer  mode  (ATM)  standard  that  will  be  used  for  broadband 
integrated  services  digital  network  (B-ISDN).  The  data-link  header  contains  information  such  as  priority 
indicator,  service  ID  and  cell  sequence  number.  This  provides  service-independent  transport  services 
such  as  priority  support,  service  multiplexing,  and  cell-error  detection  and  comection.  For  the  received 
bit-stream,  the  transport  decoder  performs  Reed-Solomon  decoding  and  a  cyclic  redundancy  check 
(CRC)  for  error  detection.  Cells  received  in  error  after  correction  are  discarded  by  the  demultiplexer. 
Each  cell  contains  148  bytes,  « 

The  proponent  claims  that  minor  picture  artifacts  may  be  noticed  at  a  packet  error  rate  of  10"'’. 
However,  AD-HDTV  will  make  viewable  pictures  and  perfect  sound  even  if  only  its  High  Priority  cells  are 
received.  This  is  effectively  an  error  rate  of  8  out  of  10  packets  lost  Out-of-order  packets  can  be  put 
back  in  order  at  the  receiver. 

i.  with  interactive  systems 

The  total  rate  control  buffer  delay  is  nominally  250  msec  for  both  the  high  priority  and  standard 
priority  buffers.  In  addition,  there  may  be  image  build-up  after  a  switch  or  a  scene  cut  depending  on  the 
place  in  the  9-frame  Group-of-Pictures  (GOP)  cycle.  Total  acquisition  time  has  been  designed  to  be 
limited  to  a  worst  case  value  of  about  1  sec.  This  number  is  based  on  0.5  sec  for  the  modem  and 
equalizer  and  0.5  sec  for  the  video/transport  acquisition. 

Latency  is  the  time  delay  between  a  video  frame  going  Into  the  encoder  and  the  oorresponding 
frame  coming  out  of  the  decoder  in  the  back-to-back  mode.  It  can  be  important  in  interactive 
applications.  According  to  the  proponent,  a  full-quality  AD-HDTV  encoder  requires  4  frames  of  latency, 
due  to  its  use  of  bi-directional  motion  compensation  with  an  MPEG  GOP  structure  of  M=3,  N=9.  An 
additional  frame  is  needed  for  interlaced-to-progressive  conversion.  Similar  delays  are  present  at  the 
receiver.  Rate  buffers  at  the  transmitter  and  at  the  receiver  cause  additional  delays.  Furthermore,  the 
system  implemented  for  test  may  not  be  representative  of  the  minimum  delay  possible.  Comparison  of 
latency  between  the  proponent  systems  is  difficult  at  this  time. 

This  proponent  claims  that  for  Interactive  applications  where  latency  is  a  concern,  an  encoder 
can  produce  an  MPEG  bit  stream  using  only  forward  motion  compensation  to  reduce  the  coding  part  of 
the  latency. 
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J.  Format  conversion 

1.  with.l325/6Q 

Up-converting  to  the  Common  Image  Format  (1920  x  1080)  requires  a  8:9  vertical  interpolation 
and  a  3:4  horizontal  interpolation  from  1440  x  960.  SMPTE  240M  currently  uses  1035  active  lines. 
Colorimetry  used  by  AD-HDTV  is  intended  to  be  consistent  with  SMPTE  240M. 

2.  With  125Q/5Q 

Conversion  between  systems  with  different  frame  rates  is  the  most  difficult  type  of  conversion 
presently  being  done.  Digital  conversion  between  59.94  fields  per  second  and  50  fields  per  second 
requires  a  number  of  frame  stores  and  very  large  processing  capability.  Simple  methods  that  involve 
frame  dropping  lead  to  jerky  motion,  but  other  techniques  produce  acceptable  images  under  most 
conditions.  This  difficult  conversion  is  not  simplified  by  the  fact  that  both  the  source  system  and  the 
target  system  are  interlaced  2:1 . 

3.  with  MPEG 

AD-HDTV's  use  of  MPEG-based  video  and  audio  compression  provides  the  possibility  of 
interoperability  with  MPEG-based  computer  multimedia  applications  directly  In  the  compressed  bit- 
stream  format.  The  underlying  video  compression  algorithm  adheres  to  the  MPEG1  standard  from  ISO, 
in  that  parameters  allowable  within  the  MPEG1  definition  are  used  although  they  are  not  the  MPEG1 
default  parameters.  Prior  to  entering  the  prioritization  and  transport  processors,  the  compressed  video 
representation  produced  at  the  encoder  fully  conforms  to  the  MPEG1  specification.  An  MPEG  bit- 
stream  can  be  obtained  from  the  output  of  the  compression  encoder  at  the  interface  to  the  priority 
processor.  Note  that  the  AD-HDTV  encoder  prototype  uses  an  Internal  fixed-length  representation  for 
MPEG  code  words  at  the  interface  between  its  compression  and  prioritization  stages,  so  that  a 
standard  MPEG1  bit  stream  Is  not  currently  available  as  an  output  from  the  prototype  hardware. 

4.  with  still  image 

AD-HDTV's  MPEG1  compression,  based  on  the  DOT,  is  generally  compatible  with  JPEG,  the 
ISO  standard  for  still-image  compression. 

Photo  CD  uses  a  proprietary  subband  coding  approach  for  compression.  Picture  interchange 
with  AD-HDTV  requires  full  decompression  and  recompression. 

GDI  uses  the  MPEG1  compression  syntax  for  motion  video.  Therefore,  AD-HDTV  will  be  directly 
compatible  with  GDI. 

K.  Scalability 

Easy  scalability  Is  claimed  by  the  proponent.  The  MPEG1  bit-stream  produced  by  the  encoder 
is  organized  into  several  layers.  The  ISO-MPEG  standard  not  only  specifies  the  form  of  the 
compressed  video  bit-stream,  but  it  incorporates  several  higher  layers  for  specifying  video  parameters. 
In  particular,  the  "sequence  header"  provides  a  mechanism  for  dynamic  specification  of  the  picture  size 
(In  terms  of  pixels  x  lines),  the  pixel  aspect  ratio  and  the  frame  hate.  With  this  mechanism,  AD-HDTV 
supports  dynamic  reconfiguration  among  its  nominal  and  optional  spatial  resolutions  and  raster  formats 
(e.g.,  interlaced  HDTV  and  progressively  scanned  film  modes).  These  capabilities  are  not  implemented 
In  the  AD-HDTV  prototype  hardware  tested  at  the  ATTG. 

The  picture  produced  by  AD-HDTV's  HP  signal  alone  is  a  reduced-quality  image.  The  proponent 
has  announced  his  intention  to  reprioritize  the  data  partition  to  make  the  HP-only  picture  useful  as  a 
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digital  hierarchy.  The  present  form  is  considered  by  the  proponent  as  sufficient  to  cany  the  viewer 
through  short  SP  channel  outages,  and  as  a  base  for  error  concealment  for  high  SP  error  rates.  The 
decoded  artifacts  observed  in  an  HP-only  reconstruction  will  depend  on  the  exact  priority  processing 
algorithm.  Typical  priority  processor  operation  results  in  a  lower  resolution  and  slightly  jerky  motion 
rendition  since  the  B-frame  data  is  carried  primarily  in  the  SP  channel. 

For  picture-in-picture  and  picture-out-of-picture,  only  the  reprioritized  HP  signal  with  its  4.8  Mbps 
data  steam  needs  to  be  processed,  with  savings  in  hardware  over  full  AD-HDTV  processing. 

For  multiple  programs  In  a  single  channel,  AD-HDTV*s  Prioritized  Data  Transport  layer  provides 
for  asynchronous  delivery  of  multiple  service  types.  Multiple  video  streams  can  be  assigned  Individual 
service  types. 


II.  Scope  of  services  and  features 

A.  Initial  Use  of  Ancillary  Data 

Because  of  AD-HDTV’s  asynchronous  data  multiplexing,  there  is  no  hard  partitioning  of  ancillary 
data  needed.  Unassigned  service  types  provide  for  the  delivery  of  many  types  of  ancillary  data. 

B.  Audio 

AD-HDTV  offers  flexibility  In  the  mix  of  audio,  video  and  data  and  therefore  the  number  of  audio 
channels.  The  audio  system  chosen  for  AD-HDTV  uses  a  coding  algorithm  called  MUSICAM,  which  is 
related  to  standards  endorsed  by  the  ISO  in  conjunction  with  MPEG.  The  audio  coding  accepts  16-bit 
samples  of  audio  at  a  rate  of  48  k-samples/second,  for  each  channel  of  the  stereo  pair.  The  MUSICAM 
algorithm  compresses  the  1 .54  Mbps  to  256  kbps.  AD-HDTV  provides  an  additional  5  dB  of  robustness 
for  audio,  as  it  transmits  audio  as  High  Priority  data.  Thus,  audio  remains  when  only  the  HP  picture  can 
be  decoded. 

C.  Data 

An  allocation  of  256  kbps  has  been  made  for  auxiliary  data,  and  that  data  has  been  set  ^ide 
within  the  prototype  equipment  as  SP  data.  A  standard  communications  interface  port  for  this  auxiliary 
data  was  provided  for  the  prototype. 

Auxiliary  data  is  normally  sent  in  standard  priority  (SP)  mode.  The  actual  priority  used  in  an 
operational  system  is  a  broadcaster  option  and  will  depend  on  the  type  of  data  service.  If  <100  kbps  of 
HP  capacity  were  used  for  data,  the  proponent  claims  that  the  impact  on  HP-only  picture  quality  would 
be  small;  for  higher  auxiliary  data  rates,  use  of  SP  is  recommended,  since  additional  reliability  in  data 
services  can  be  provided  via  appropriate  retransmission  schedules,  etc.  Also,  see  G.  Addressing. 

D.  Text 

Text  delivery  is  discussed  below  in  Section  I  on  Overlay. 

E.  Captioning 

It  was  suggested  that  9.6  kbps  of  the  ancillary  data  capacity  be  used  for  closed  captioning. 
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F.  Encryption 

The  prototype  hardware  did  not  include  encryption,  but  AD-HDTV's  transport  structure  has  been 
designed  to  readily  accommodate  encryption.  The  proponent  expects  to  detail  the  encryption  method 
with  industry  participation. 

G.  Addressing 

The  system  provides  opportunities  for  high-data-rate  burst-mode  delivery  of  auxiliary  data.  As  a 
limit,  the  entire  channel  capacity,  18.5  Mbps,  could  be  dedicated  to  addressing  a  large  number  of 
receivers  with  decryption  keys  during  the  few  seconds  preceding  a  popular  pay^r-viw  program.  A 
still  picture  with  audio  using  4  Mbps  could  be  transmitted  while  14.5  f^ps  Is  devoted  to  subscriber 
addressing.  Conditional  access  data  can  be  treated  as  a  special  service  type  into  its 

own  transport  cells,  or  included  within  the  video  and  audio  data.  It  can  be  decoded  without  decoding 

video  and  audio. 


H.  Low-cost  receiver 

AD-HDTV  Is  partitioned  into  High  Priority  and  Standard  Priority  data  strearns  that  rnodulate 
separate  carriers.  A  low-cost  receiver  can  be  built  to  use  only  the  HP  stream  with  savings  on  the  tuner 
and  processing  components.  The  HP-only  mode  has  been  demonstrated,  showing  high  resolution  but 
poor  response  to  motion.  The  proponent  anticipates  improvements  to  the  prioritization  algorithms  to 
make  HP-only  pictures  that  are  much  better  suited  for  small,  low-cost  receivers. 


I.  VCR  capability 

No  hardware  development  of  VCR's  has  been  reported.  However,  it  is  thought  that  digital 

recording  is  within  current  technology  for  consumer  use.  _ 

MPEG  has  periodically  occurring  frames  that  are  entirely  spatially  coded.  This  is  said  to  provide 
the  ability  to  reconstruct  pictures  in  fast-fonvard  and  reverse  scanning  modes  from  digital  storage 
mcciis 

MPEG  compression  has  periodic  spatially-coded  frames,  allowing  splices  and  inserts  to  be 
made  on  GOP  boundaries  in  the  compressed  transmission  format  Limited  picture  cropping  can  be 

handled  in  compressed  form  If  it  aligns  with  macroblock  boundaries.  ^  „ 

AD-HDTV  has  provision  for  carrying  text  and  graphics  overlay  data.  This  data  can  be  sent  as  a 
separate  service  type,  to  be  superimposed  on  the  display  at  the  receiver.  The  proponent  rerornmends 
that  receivers  have,  as  a  minimum,  overlay  capability  for  simple  text.  This  would  permit  the  loca 
broadcaster  to  Insert  text  of  local  importance  into  the  data  stream  containing  the  compressed  sign^ 
that  he  passes  through  from  the  network  source  to  the  viewers,  without  decompressing  and 

recompressing  the  video. 


III.  Extensibility 

A.  to  no  visible  artifacts 

AD-HDTV  has.  as  Its  core,  the  MPEG1  compression  standard  that  is  extensible  to  virtually  any 
data  rate.  The  proponent  points  out  that  AD-HDTV  at  17.7  Mbps  Is  already  an  extension  of  the 
baseline  MPEG1  parameters  which  encode  low-resolution  video  at  1.5  Mbps. 
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B.  to  studiQ-quality  dataiate 

AD-HDTV  was  designed  with  the  anticipation  of  several  levels  of  related  MPEG  compression. 
The  proponent  suggests  that  a  studio  standard  could  be  set  at  216  Mbps,  the  data  rate  of  existing 
studio  D-1  recorders.  He  believes  that  this  standard  would  be  of  extremely  high  quality  and  would 
permit  studio  processing  without  degradation. 

C.  to  higher  resolution 

The  proponent  claims  that  AD-HDTV  potentially  supports  the  delivery  of  other  video  and  image 
formats  over  appropriate  bandwidth  channels  to  special  receivers  with  Increased  memory.  The  MPEG1 
core  allows  resolutions  up  to  4095  x  4095.  Examples  given  by  the  proponent  follow.  One  is  a  1 280  x 
1024  progressive  scan,  logically  square  pixel  format  that  is  appropriate  for  computer  workstations. 
•Another  is  a  65,520  x  65,520  very-high-resolution  still-image  format  (the  limit  of  JPEG  addressing 
capability).  Even  higher  resolutions  can  be  delivered  by  extending  the  JPEG  addressing  capability. 
Such  formats  would  likely  prove  useful  in  medical  imaging,  computer  graphics,  and  space  and  defense 
applications. 

The  proponent  has  discussed  the  possibility  of  introducing  ultra-high-definition  television  by 
sending  augmentation  data  packets  assigned  to  a  unique  service  type  that  will  be  disregarded  by  older 
receivers  but  processed  by  new  receivers. 

D.  Provision  for  future  compression  enhancement 

The  proponent  suggests  improved  calculation  of  motion  vectors  and  improvements  in  bit 
allocation  and  prioritization  as  likely  means  for  picture  quality  improvement  without  changing  the 
receivers  or  the  data  rate. 
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System  Description 

The  Channel  Compatible  DigiCipher  (CCDC)  HDTV  System,  aiso  Imown  as  the  Advanced 
Television  Alliance  (ATVA)  Progressive  System,  is  an  all-digital  system  that  is  progressively  scanned 
with  787.5  lines  per  frame  and  59.94  frames  per  second.  The  display  format  has  square  pixels  with  / 
720  active  lines  by  1280  pixels  per  line  In  a  16:9  image  aspect  ratio.  Severai  types  of  digital  coding  are 
used  for  compression  and  signal  robustness.  Transmission  is  by  quadrature  amplitude  modulation  with 
4  or  5  bits  per  symbol,  offering  the  broadcaster  the  choice  of  two  data  rates  involving  a  tra^on 
between  picture  quality  and  coverage  area.  The  system  accommodates  either  16-  or  32 -QAM 
modulation.  The  proponent  emphasizes  that  the  system  was  designed  to  be  highly  modular,  so  that  the 
video  processing,  the  audio  processing,  and  the  transmission  system  can  be  used  Independently. 


I.  Interoperability 

A.  with  Cable  TV 

The  tests  of  the  proponent  systems  at  ATTC  by  Cable  Labs  should  determine  the  critical 
aspects  of  interoperability  with  cable  TV.  Other  factors  that  may  aid  cable  TV  interoperability  Include 
channel  augmentation  to  9  or  12  MHz  for  improved  picture,  or  better,  going  to  a  higher  data  rate  that 
can  be  supported  in  the  6  MHz  channel  by  the  relatively  benign  cable  environment.  The  question 
remains  whether  HDTV  receivers  can  be  built  to  support  higher  numbers  of  bits  per  symbol  without  cost 

impact.  «  «  It  •*  c 

Encryption  and  addressing  are  Important  service  features  for  Cable  TV.  See  Section  li,  items  r 

and  G. 


B.  with  digital  technology 

Since  this  system  is  all-digital,  the  advantages  of  all-digital  systems  apply. 

C.  Headers/descriotors 

A  frame  header  identifies  the  video  source  material,  the  frame  rate,  resolution,  aspect  ratio,  and 
other  system  data. 

D.  with  NTSC 

As  the  CCDC  system  is  directly  related  to  NTSC,  transcoding  to  NTSC  is  straightforward.  Edge- 
crop  conversion  to  CCIR  601  Involves  discarding  160  HD  samples  from  each  end  of  the  horizontal  line 
and  4:3  Interpolation  of  the  remaining  samples.  It  can  Involve  3:1  and  2:1  vertical  Interpolatioi^  m 
alternate  fields.  For  letter-box.  a  16.-9  horizontal  interpolation  is  required,  as  well  as  a  3:1  vertical 
Interpolation  of  the  active  lines.  For  edge-crop  display,  the  480  lines  can  be  placed  in  a  specific  set  ot 

the  483  active  lines.  The  first  (odd)  field  can  be  displayed  on  lines  21.  23 . 499,  and  the  second 

(even)  field  is  displayed  on  lines  22. 24 .  500.  The  active  lines  501  and  503  of  the  odd  fields  and  502 

of  the  even  fields  can  be  displayed  as  black.  For  letter-box  display,  the  vertical  interpolation  is  more 


9 


DRAFT 


CCDC 


complicated.  Up-conversion  from  NTSC  requires  line  tripling,  horizontal  line-rate  conversion  and 
interpolation. 

E.  with  film 

The  prototype  has  the  following  film  mode.  Film  is  displayed  with  the  three-two  pulldown 
process  for  24  frame/sec  film  and  with  simple  frame  repetition  for  30  frame/sec  film.  The  proponent 
claims  actual  frame  rates  of  59.94, 30  and  24  frames/second,  implying  that  no  film  slowdown  is  needed 
to  accommodate  the  standard  video  frame  rate,  the  encoder  automatically  detects  the  presence  of  24 
frame/sec  or  30  frames/sec  scene  material  from  film  sources.  When  a  film  source  is  detected,  an 
alternate  buffer  control  algorithm  is  used  which  takes  advantage  of  repeated  frames  in  the  source. 
With  the  scanning  method  used  in  CCDC,  only  two  out  of  each  five  TV  frames  need  to  be  sent  for  24 
frame/sec  film.  The  proponent  claims  that  the  video  quality  in  the  film  mode  is  essentially  equivalent  to 
the  uncoded  source.  He  also  claims  that  full-resolution  color  is  possible  In  the  film  mode,  though  that 
capability  is  not  in  the  prototype. 

F.  with  computers 

Progressive  scanning  and  square  pixels,  both  used  in  this  system,  are  important  factors  for 
interoperability  of  an  HDTV  system  with  computers  -  nearly  all  existing  bit-mapp)ed  computer  graphics 
displays  have  these  features.  Progressive  scanning  and  square  pixels  are  most  critical  for  real-time 
applications  such  as  display,  scan-conversion,  frame  capture,  and  video  effects.  Computers  are 
expected  to  play  an  increasing  role  in  video  image  generation  and  production,  and  it  is  desirable  to 
have  an  HDTV  format  which  allows  direct  display  and  manipulation  of  HDTV  video  on  the  computer. 
Progressive  scanning  is  preferable  for  computer  applications  to  avoid  artifacts  that  are  common  with 
interlaced  display  of  computer  generated  imagery.  Also,  scan  conversion  between  interlaced  and 
progressive  systems  can  produce  undesirable  artifacts.  Square  pixels  are  inherent  to  graphics  display 
hardware  for  ail  popular  computers  employing  bit-mapped  displays.  The  main  reason  is  that  it 
facilitates  processing  of  2D  transformations,  especially  rotations. 

G.  with  satellites 

Digital  signals  with  bit  rates  used  by  the  proponents  for  terrestrial  transmission  can  generally  be 
accommodated  in  the  available  bandwidth  of  a  satellite  transponder.  The  total  data  rate  for  the  CCDC 
system  in  the  mode  tested  by  ATTC  is  26.4  Mbps.  The  proponent  suggests  that  8-PSK  modulation 
would  permit  two  CCDC  signals  per  36  MHz  transponder.  However,  normal  transmission  by  satellite  is 
QPSK  (4-phase).  CCDC,  which  has  the  highest  data  rate  of  the  all-digital  proponents,  would  be  the 
most  difficult  to  transmit  with  two  channels  per  transponder.  Nevertheless,  using  the  19.9-  Mbps 
information  rate  of  CCDC,  Reed-Solomon  coding,  and  rate  7/8  convolutional  coding,  two  channels  can 
probably  be  transmitted  in  a  36-MHz  transponder. 

H.  with  packet  networks 

CCDC  data  is  organized  into  525  data  lines  per  frame.  These  data  lines  could  be  used  as 
packets  if  augmented  with  packet  assembly  information.  Error  concealment,  already  implemented, 
would  ensure  some  resistance  to  packet  loss.  The  proponent  has  suggested  implementation  of  a 
packetized  structure  after  field  testing. 
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I.  with  interactive  systems 

A  form  of  interactivity  important  to  cable  operators  is  the  speed  with  which  a  customer  can  scan 

through  many  channels.  This  system  is  reported  to  acquire  a  channel  in  0.4  seconds,  providing  the 

sound  and  beginning  to  build  the  picture.  The  picture  is  at  full  resolution  in  0.73  seconds. 

Latency  is  the  time  delay  between  a  video  frame  going  into  the  encoder  and  the  corresponding 
frame  coming  out  of  the  decoder  In  the  back-to-back  mode.  It  can  be  important  in  interactive 
applications.  Frame  delays  are  required  at  the  transmitter  and  at  the  receivers  for  coding  and 
decoding.  Rate  buffers  at  the  transmitter  and  at  the  receiver  cause  additional  delays.  Furthermore,  the 
system  implemented  for  test  may  not  be  representative  of  the  minimum  delay  possible.  Comparison  of 
latency  between  the  proponent  systems  is  difficult  at  this  time. 

CCDC  reports  a  video  delay  of  6  or  6  frames,  corresponding  to  83  to  100  ms.  The  exact  time  is 
said  to  depend  on  how  the  frame  buffer  is  used,  with  the  video/film  selection  a  factor. 

J.  Format  conversion 

1.  1125/60 

Upconverting  to  the  Common  Image  Format  (1920  x  1080)  is  easily  done  by  2:3  interpolation 
horizontally  and  vertically.  SMPTE  240M  currently  uses  1035  active  lines.  Colorimetry  is  SMPTE 
240M. 

2.  325Q/5Q 

Conversion  between  systems  with  different  frame  rates  is  the  most  difficult  type  of  conversion 
presently  being  done.  Digital  conversion  between  59.94  fields  per  second  and  50  fields  per  second 
requires  a  number  of  frame  stores  and  very  large  processing  capability.  Present  methods  that  involve 
frame  dropping  lead  to  jerky  motion,  but  other  techniques  can  produce  acceptable  Images  under  most 
conditions.  This  difficult  conversion  may  be  easier  from  a  progressive  source  than  from  an  interlaced 
source. 


3.  MPEG 

There  is  no  direct  compatibility  in  terms  of  bit  stream.  However,  see  the  MPEG  discussion  for 
DigiCipher. 


4.  with  still  image 

The  capture  of  still  images  from  video  is  favored  by  progressive  scan. 

K.  Scalability 

The  proponent  claims  that  the  system  is  scalable  in  the  sense  that  the  decoded  signal  may  be 
Interpolated  or  decimated  to  suit  various  displays.  Scalability  by  picture  interpolation  can  be 
implemented  in  any  proposed  system.  It  Is  simplified  by  the  progressive  scanning  In  this  system.  The 
system  may  also  be  scalable  in  the  sense  that  a  subset  of  the  bits,  such  as  those  that  comespond  to 
the  lower-order  discrete  cosine  coefficients,  could  be  used  by  lower-cost  VLSI  to  obtain  a  lower- 
resolution  signal.  However,  the  entire  bit  stream  must  be  available  for  the  subset  to  be  extracted,  and 
the  other  digital  systems  have  similar  capabilities.  Picture-in-picture  and  picture-out-of-picture  are 
handled  by  standard  methods  in  the  receiver. 
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II.  Scope  of  services  and  features 

A.  Initial  Use  of  Ancillary  Data 

Assignments  have  not  yet  been  made.  See  Data. 

B.  Audio 

The  CCDC  system  is  designed  to  have  four  channels  of  audio  using  503  kbps  or  six  channels  of 
audio  using  755  kbps.  Unused  audio  capacity  is  used  to  carry  additional  video  data.  The  audio  system 
uses  a  coding  algorithm  called  MIT  Audio  Coder,  presently  implemented  with  Motorola  DSP-96002 
floating-point  digital  signal  processors.  It  accepts  16-bit  audio  samples  at  a  rate  of  48  kHz  for  each 
channel.  All  audio  channels  are  coded  independently.  Capacity  has  been  reserved  for  dynamic  bit 
allocation  for  different  parameter  sets.  The  proponent  claims  that  this  will  permit  the  system  to 
incorporate  future  improvements  in  audio  compression  without  changing  the  receivers. 

C.  Data 

Auxiliary  and  control  data  has  been  allocated  251.8  kbps.  This  may  be  implemented  with  an 
RS-422  interface.  In  the  current  hardware,  the  only  access  to  the  auxiliary  data  channel  is  via  four 
asynchronous  9600-bps  RS-232  interfaces.  However,  the  design  can  be  modified  to  permit  data 
capacity,  not  used  by  audio,  to  be  reassigned  to  data  services. 

D.  Text 

E.  Captioning 

Teletext  and  captioning  is  sent  in  the  ancillary  data  channel.  Additional  ancillary  data  capacity 
will  be  provided  if  needed. 

F.  Encryption 

G.  Addressing 

The  first  byte  in  each  data  line  is  reserved  for  control  information,  described  as  including 
decryption  keys  and  subscriber  data.  There  are  525  data  lines  per  frame  and  59.94  frames  per  second. 
Thus,  there  are  about  252  kbps  of  capacity  for  this  kind  of  data.  Encryption  has  not  yet  been 
implemented. 

H.  Low-cost  receiver 

This  system  will  permit  lower  receiver  cost  using  lower-resolution  displays.  The  proponent 
claims  that  progressive  scan  used  in  this  system  favors  such  scalability.  The  low-cost  receiver  would 
have  to  extract  the  information  it  can  use  from  the  complete  data  stream,  perhaps  just  the  DC  DCT 
coefficients,  but  would  save  on  processing  capability  as  well  as  display  cost.  The  proponent  claims  that 
the  standard  HDTV  receiver  can  be  built  with  12  custom  VLSI's  and  2  Mbytes  of  memory. 

I.  VCR  capability 

The  proponent  reports  no  hardware  development  of  VCR's  specific  to  CCDC,  but  refers  to  the 
DigiCipher/ToshIba  VCR  that  has  been  demonstrated  within  ATVA.  The  CCDC  data  stream,  about  20 
Mbps,  is  within  the  capability  of  current  technology  for  consumer  use. 

It  is  claimed  that  a  rapid  search  mode  can  be  implemented  by  reconstojcting  the  images  from 
those  blocks  coded  with  no  temporal  predictor.  This  gives  at  least  three  displayable  frames  for  every 

12 


DRAFT 


CCDC 


60.  Additional  intra-coded  blocks  may  also  be  used  as  they  occur.  The  resultant  picture  would  have 
full  resolution,  but  may  include  artifacts.  The  reverse  playback  cannot  be  done  with  full  quality  because 
predicted  frames  cannot  be  generated.  .  «  .  ,  ,. 

Splice  and  insert  could  be  handled  by  forcing  the  receiver  to  reacquire.  Crop  and  overlay  would 

require  that  the  data  stream  be  decompressed  first.  .  ,  „  u 

Square  pixels  and  progressive  scanning  simplify  the  Implementation  of  special  effects  such  as 

zooming  and  panning. 


111.  Extensibility 

A.  to  no  visible  artifacts 

Based  on  simulation  tests,  the  proponent  believes  that  the  compression  algorithm  will  produce 
no  visible  artifacts  at  a  data  rate  of  50  Mbps,  regardless  of  the  difficulty  of  the  camera-generated  source 
material. 

B.  to  studio-Qualitv  data  rate 

According  to  the  proponent,  the  Intraframe  encoding  mode  for  the  whole  frame  can  be  used  for 
a  production  standard.  Here,  every  frame  is  encoded  without  motion  prediction.  Production-quality 
video  with  a  resolution  of  1280  x  720  can  be  stored  with  3  Mb/frame  using  the  Intraframe  compression 
method  included  in  this  system.  At  60  frames  per  second,  the  bit  rate  is  180  Mbps,  an  acceptable  rate 
for  studio  use.  The  proponent  claims  that  the  frame  can  be  decoded  and  re-encoded  many  times  with 
little  degradation. 

C.  to  higher  resolution 

Currently  the  system  is  designed  to  display  1280  x  720  image  sequences,  but  larger  sizes  can 
be  specified  as  part  of  the  frame  header. 

D.  Provision  for  future  compression  enhancement 

The  compression  algorithm  can  be  Improved  by  performing  better  motion  estimation,  and 
including  better  perceptual  criteria  at  the  transmitter.  These  involve  no  changes  at  the  receiver. 
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DidCiPher  HDTV  Interoperability  Assessment 


System  Description 

Also  known  as  the  Advanced  Television  Alliance  (ATVA)  Interlaced  system,  the  DigiCipher 
HDTV  system  is  an  all-digital  system  with  1050  scan  lines  per  frame,  59.94  fields  per  second,  and  2:1 
Interlace.  The  display  format  has  a  16:9  image  aspect  ratio  with  960  lines  per  frame  and  1408 
pixels/line.  Several  t^es  of  digital  coding  are  used  for  compression  and  signal  robustness.  The 
primary  picture  coding  is  discrete  cosine  transform  with  motion  compensation.  Transmission  is  by 
quadrature  amplitude  modulation  with  4  or  5  bits  per  symbol,  offering  the  broadcaster  the  choice  of  two 
data  rates  involving  a  tradeoff  between  picture  quality  and  coverage  area.  The  system  accommodates 
either  16-  or  32-QAM  modulation. 

I.  Interoperability 

A.  with  Cable  TV 

The  tests  of  the  proponent  systems  at  ATTC  by  Cable  Labs  should  determine  the  critical 
aspects  of  interoperability  with  cable  TV.  Other  factors  that  may  aid  cable  TV  interoperability  include 
channel  augmentation  to  9  or  12  MHz  for  improved  picture,  or  better,  going  to  a  higher  data  rate  that 
can  be  supported  in  the  6  MHz  channel  by  the  relatively  benign  cable  environment.  The  proponent 
states  that  it  is  likely  that  a  rate  higher  than  32-QAM  will  be  used  for  multichannel  NTSC,  and  has 
expressed  the  Intent  to  maintain  compatibility  between  DigiCipher  Multichannel  NTSC  and  HDTV.  The 
question  remains  whether  HDTV  receivers  can  be  built  to  support  higher  numbers  of  bits  per  symbol 
without  cost  impact.  The  proponent  has  conducted  demonstrations  of  one-way  and  two-way  operation 
over  existing  cable  TV  plants. 

Encryption  and  addressing  are  also  important  service  features  for  Cable  TV.  See  Section  11, 
items  F  and  G. 

B.  yyilh  digital  technology 

Since  this  system  is  all-digital,  the  advantages  of  all-digital  systems  apply.  The  proponent  has 
agreed  to  determine  the  feasibility  of  a  common  standard  between  the  DigiCipher  digital  bus  and  the 
EIA  CE  bus. 

c.  Headers/descriptors 

The  proponent  discussed  the  use  of  the  andllary  data  space  for  transmitting  the  program  name, 
remaining  times  and  program  rating,  in  the  prototype,  there  is  a  7-byte  header  at  the  beginning  of  each 
data  frame.  Four  bydes  are  used,  and  three  bytes  are  available.  There  is  a  one-byte  header  at  the 
beginning  of  each  video  frame,  of  which  one  bit  Is  available.  There  is  a  fully  defined  two-byte  header  at 
the  beginning  of  each  macroblock.  In  his  certification  disclosure,  the  proponent  discussed  his  intention 
to  implement  adaptive  allocation  between  video,  audio  and  ancillary  data.  The  proponent  has  now 
announced  an  improvement  Involving  expansion  of  the  video  frame  and  macroblock  headers,  and 
packetization  at  the  transwitch  layer.  Transwitch  layer  packetization  makes  the  data  allocations 
adaptive,  allowing  flexible  allocation  of  capacity  among  video,  audio,  and  data  applications.  The 
proponent  states  that  this  modified  data  format  will  be  implemented  in  the  protot^e  prior  to  field 
testing. 
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D.  with  NTSC 

The  proponent  selected  the  field  rate  of  59.94  Hz  for  compatibility  with  NTSC.  The  number  of 
active  video  lines,  960,  was  selected  to  be  double  the  number  of  active  NTSC  lines,  480.  (NTSC 
normally  has  483  active  video  lines,  but  483  is  an  awkward  number  for  conversions.)  Since  the  aspect 
ratio  of  this  system,  16:9,  is  different  from  the  aspect  ratio  of  NTSC,  4:3,  a  choice  must  be  made  for  the 
'  conversion. .  The  limiting  choices  for  down-conversion  are  1)  "Edge  Crop",  in  which  the  HDTV  picture 
fills  480  lines  on  NTSC  with  loss  of  the  sides  of  the  HDTV  picture,  and  2)  "Letterbox",  in  which  the  full 
width  of  the  HDTV  picture  is  displayed  in  the  full  width  of  the  NTSC  picture,  but  leaving  unused  areas  at 
the  top  and  the  bottom  of  the  NTSC  screen.  Pixel  values  from  HDTV  lines  are  stored  in  memory  and 
read  out  at  reduced  speed  to  make  NTSC  lines.  The  range  of  pixels  read  and  the  clocking  rate 
depend  upon  the  choice  of  edge-crop  or  letterbox.  The  down-conversion  involves  interpolation 
between  HDTV  pixels  in  a  line  and  between  HDTV  lines. 

E.  with  film 

The  prototype  system  accepts  film  shot  at  24  frames  per  second  as  59.94-Hz  video,  2:1 
interlaced,  having  been  converted  by  the  three-two  pulldown  technique.  The  DigiCipher  encoder 
recognizes  the  redundancy  in  each  five-field  sequence  as  having  originated  in  24-frame  film,  and 
converts  the  59.94-field  video  back  to  23.98  frames/second.  The  Image  is  processed  and  transmitted 
as  23.98  frame  progressive,  and  converted  back  to  59.94-field  interlace  in  the  decoder,  using  3:2  pull¬ 
down.  Future  receivers  could  alternatively  use  3:1  frame  repeat  to  display  progressive  at  72  Hz. 
Similarly,  30-frame  film  source  which  is  delivered  to  the  encoder  as  59,94-field  video  is  processed  and 
transmitted  as  29.97-frame  progressive.  The  benefit  is  more  efficient  coding,  and  thus  higher  quality. 
The  proponent  has  proposed  a  system  Improvement  which  would  allow  receiving  and  processing 
images  directly  in  24-  and  30-frame  progressive.  He  also  suggests  that  in  the  future,  spatial  resolution 
could  be  increased  as  temporal  resolution  is  decreased. 

F.  with  computers 

Progressive  scanning  and  square  pixels,  not  included  in  the  DigiCipher  system  tested,  are 
imp)ortant  factors  for  Interoperability  of  an  HDTV  system  with  computers  —  nearly  all  existing  bit-mapped 
computer  graphics  displays  have  these  features.  Progressive  scanning  and  square  pixels  are  most 
critical  for  real-time  applications  such  as  display,  scan-conversion,  frame  capture,  and  video  effects. 
Computers  are  expected  to  play  an  increasing  role  in  video  image  generation  and  production  and  It  is 
desirable  to  have  an  HDTV  format  which  allows  direct  display  and  iroriipulation  of  HDTV  video  on  the 
computer.  Progressive  scanning  is  preferable  for  computer  applications  to  avoid  artifacts  that  are 
common  with  interlaced  display  of  computer  generated  Imagery.  Also,  scan  inversion  between 
interlaced  and  progressive  systems  can  produce  undesirable  artifacts.  Square  pixels  are  inherent  to 
graphics  display  hardware  for  all  popular  computers  employing  bit-mapped  displays.  The  main  reason 
is  that  it  facilitates  processing  of  2D  transfonnations,  especially  rotations.  Non-square  pixels  do  not 
present  a  problem  for  the  display  of  RGB  signals  from  a  computer's  video  card  on  HDTV  receivers,  but 
would  complicate  more  sophisticated  attempts  at  Interoperability,  such  as  the  display  by  an  HDTV 
receiver  of  a  picture  from  a  digital  data  stream  generated  by  a  computer.  This  system  has  pixels  that 

are  21%  wider  than  high.  ^  . 

The  prototype  hardware  was  built  to  select  between  field  processing  and  frame  processing  tor 
each  superblock,  depending  upon  its  motion,  in  order  to  provide  optimum  motion  handling.  However, 
computer  interoperability  would  be  enhanced  if  the  encoder  were  forced  to  do  frame  processing  on  all 
superblocks.  With  this  feature,  coding  and  transmission  would  be  In  progressive  form.  The  proponent 
has  proposed  adding  this  feature  as  an  option  at  the  encoder. 
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G.  with  satellites 

Satellite  transmission  of  the  DigiCipher  HDTV  signal  was  recently  demonstrated  using  QPSK  in 
a  24-MHz  bandwidth,  achieving  a  raw  data  rate  of  39  Mbps.  Instead  of  the  trellis  coding  used  in  the 
terrestrial  system,  convolutional  coding  with  a  Viterbi  decoder  was  used.  The  coding  was  rate-1 /2,  so 
that  the  data  rate  after  Viterbi  decoding  was  19.51  Mbps.  Reed-Solomon  coding  was  also  used,  with 
the  information  rate  being  18.2  Mbps,  identical  to  the  terrestrial  signal.  A  5.5  dB  C/N  threshold  was 
achieved;  an  improvement  over  the  8+  dB  threshold  typically  achieved  in  NTSC  satellite  transmission. 
The  proponent  recommends  using  rate-3/4  coding  to  yield  a  50%  increase  in  the  information  rate.  This 
would  support  a  higher-level  compressed  HDTV  signal  or  an  NTSC  signal  sharing  the  channel  with  the 
HDTV  signal.  In  a  36-MHz  transponder,  two  transmission-quality  HDTV  signals,  or  alternatively,  one 
distribution-quality,  40-45  Mbps  signal  can  be  transmitted. 

H.  with  packet  networks 

In  the  prototype  tested  at  ATTC,  the  video  data  was  packetized,  with  each  packet,  about  2  kb 
long,  containing  one  macroblock  of  video  data.  For  lost  packets,  the  decoder  would  have  used  error 
concealment  which  was  already  implemented  to  handle  transmission  errors. 

The  proponent  has  since  offered  a  redesign  of  the  data  structure  with  headers/descriptors.  The 
benefits  of  this  approach  include  improved  packet  transmission.  Packets  will  be  orgariized  by  data 
type.  Packet  length  will  be  155  bytes,  with  the  payload  being  141  bytes.  Out-of-order  packets  can  be 
reordered  in  a  suitably  designed  receiver.  Implementation  is  expected  after  field  testing. 

I.  with  interactive  systems 

A  form  of  Interactivity  important  to  cable  operators  is  the  speed  with  which  a  customer  can  scan 
through  many  channels.  This  system  is  reported  to  acquire  a  channei  in  0.4  seconds,  providing  the 
sound  and  beginning  to  build  the  picture.  The  picture  is  at  full  resolution  in  0.77  seconds. 

Latency  is  the  time  delay  between  a  video  frame  going  into  the  encoder  and  the  corresponding 
frame  coming  out  of  the  decoder  in  the  back-to-back  mode.  It  can  be  important  in  interactive 
applications.  Frame  delays  are  required  at  the  transmitter  and  at  the  receiver  for  coding  and  decoding. 
Additional  delay  may  be  needed  to  facilitate  frame  coding  in  an  interlaced  system.  Rate  buffers  at  the 
transmitter  and  at  the  receiver  also  cause  delays.  Furthermore,  the  system  implemented  for  test  may 
not  be  representative  of  the  minimum  delay  possible.  Comparison  of  latency  between  the  proponent 
systems  is  difficult  at  this  time. 

According  to  the  proponent,  the  latency  of  DigiCipher  is  83  msec. 

J.  Format  conversion 

1.  with  1125/60 

Up-converting  to  the  Common  Image  Format  (1920  x  1080)  requires  a  8:9  vertical  interpolation 
from  1408  x  960.  SMPTE  240M  currently  uses  1035  active  lines.  Conversion  from  1408  pixels/line  to 
1920  pixels/line  requires  an  11-to-15  interpolation.  The  proponent  says  that  15  interpolation  filters  are 
required,  each  having  5-6  taps.  Colorimetry  used  by  DigiCipher  is  the  same  as  SMPTE  240M. 

2.  with  125Q/5Q 

Conversion  between  systems  with  different  frame  rates  is  the  most  difficult  type  of  conversion 
presently  being  done.  Digital  conversion  between  59.94  fields  per  second  and  50  fields  per  second 
requires  a  number  of  frame  stores  and  very  large  processing  capability.  Present  methods  that  involve 
frame  dropping  lead  to  jerky  motion,  but  other  techniques  can  produce  acceptable  images  under  most 
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conditions.  This  difficult  conversion  is  not  simplified  by  the  fact  that  both  the  source  system  and  the 
target  system  are  interlaced  2:1 . 

3.  with  MPEG 

The  DigiCipher  HD  decoder  would  require  modification  to  decode  MPEG  or  H.261.  The 
proponent  states  that  there  would  be  only  a  modest  increase  in  complexity  because  DigiCipher  shares 
many  commonalties  with  MPEG  and  H.261 .  MPEG  and  H.261  decoders  will  not  decode  DigiCipher  HD. 

4.  with  still  image 

The  proponent  has  identified  still-frame  as  a  useful  capability,  and  believes  that  forward 
compatibility  with  JPEG,  Photo  CD  and  CDI  is  feasible  and  Is  a  marketplace  issue.  The  frame-coding 
option  offered  by  the  proponent  (See  l.F)  enhances  compatibility  with  still  images. 

K.  Scalability 

Though  the  receive  and  display  clocks  are  currently  linked,  the  proponent  proposes  to  operate 
them  independently  in  the  future.  The  receiver  could  then  receive  non-real-time  video  at  slower  rates. 

According  to  the  proponent,  picture-in-picture  and  picture-out-of-picture  are  possible  with 
DigiCipher  as  receiver  design  options. 

L.  Other  Compatibility  Features 

DigiCipher  HD  processes  the  image  in  four  parallel  panels.  Each  panel  processor  is  comparable 
to  a  DigiCipher  NTSC  processor  and  thus  is  able  to  process  a  DigiCipher  NTSC  signal.  There  Is  also  a 
compatible  bus  that  can  support  both  NTSC  and  HD  signals.  The  proponent  claims  that  the 
compatibility  extends  to  VCR's,  and  satellite  and  cable  receivers. 


II.  Scope  of  services  and  features 

A.  Initial  Use  of  Ancillary  Data 

The  use  of  the  ancillary  data  capacity  is  not  yet  specified. 

B.  Audio 

The  DigiCipher  system  provides  for  four  independent  audio  channels,  each  sampled  at  47.2  kHz 
(to  be  changed  to  48  kHz)  and  digitized  to  16-bit  precision.  The  system  uses  two  Dolby  AC-2 
compression  systems  operating  at  24-bit  precision.  The  two  compressed  audio  data  streams  are 
formatted  along  with  a  1200-bps  control  signal  into  a  single  serial  output  data  stream  at  503  kbps,  to  be 
multiplexed  into  the  transmitted  signal.  The  system  description  discusses  extra  error  protection  within 
the  Dolby  AC-2  decoder.  This  leads  to  audio  that  remains  acceptable  under  signal  conditions  where 

the  picture  fails.  _  „  ^  j  c<  .,h.,«noi 

The  proponent  has  proposed  to  incorporate  the  Dolby  AC-3  composite-coded  5.1  channe 

surround-sound  system  into  the  prototype  prior  to  field  testing.  This  system  offers  a  variety  of  modes  of 
operation.  Including  dual  independently  coded  AC2A  channels.  The  proponent  also  suggeste  that  the 
use  of  packetization  will  permit  potentially  numerous  composite-coded  and  independently  coded  audio 
channels  to  be  transmitted,  with  the  receiver  determining  which  to  process. 
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C.  Data 

A  126-kbps  channel  is  provided  for  ancillary  data.  This  is  distinguished  from  a  separate  126- 
kbps  channel  assigned  for  conditional  access  use.  The  planned  incorporation  of  packetization  at  the 
transwitch  layer  will  allow  the  system  to  flexibly  allocate  transmission  capacity  to  data. 

D.  Text 

In  the  prototype,  four  data  channels  at  9600  bps  were  implemented  to  illustrate  asynchronous 
data  transmission. 

E.  Captioning 

The  proponent  has  stated  an  intention  to  demonstrate  closed-captioning  compatibility  when 
prototype  captioning  hardware  becomes  available  in  mid-1993. 

F.  Encryption 

Encryption  was  not  implemented  in  the  prototype.  However,  the  proponent  claims  to  have 
developed  “smart  card*  security  for  both  VideoCipher  and  DigiCipher.  With  packetization  by  data  type, 
the  different  types  of  data  will  be  encrypted  separately. 

G.  Addressing 

A  126-kbps  channel  is  assigned  for  conditional  access  use  (subscriber  addressing),  separate 
from  the  126-kbps  channel  for  auxiliary  data.  The  proponent  suggests  that  with  this  data  channel,  50- 
100  million  receivers  could  be  addressed  in  less  than  one  day.  The  proponent  offers  to  implement  fully 
packetized  data  that  will  allow  flexible  allocation  of  data  with  increased  peak-load  subscriber-addressing 
capability. 

H.  Low-cost  receiver 

The  proponent  suggests  a  low-cost  receiver  that  displays  176  x  120  pixels  by  extracting  and 
processing  only  the  DCT  DC  coefficients.  Picture  quality  Is  suggested  to  be  comparable  to  that  of 
NTSC  on  a  2“  LCD  display.  Extracting  the  DCT  DC  coefficients  for  a  low-cost  receiver  can  be 
implemented  in  any  of  the  proposed  digital  systems. 

L  VCR  capability 

A  consumer-grade  VCR  has  been  publicly  exhibited  by  Gl  and  Toshiba.  It  records  a  digital 
signal  at  the  18.2  Mbps  information  rate  of  compressed  DigiCipher.  It  uses  two-hour  metallized-tape 
cassettes,  similar  in  format  to  8-mm  NTSC  cassettes. 

The  proponent  reports  running  simulations,  showing  that  a  full  set  of  trick  mode  features  can  be 
supported.  The  DigiCipher  VCR  uses  PCM  refresh  data  from  each  field,  and  attempts  to  use  DPCM 
data  too. 

Switching  between  compressed  video  images  should  be  done  at  frame  sync,  preferably  with  the 
new  scene  either  at  black  or  at  a  scene  change  when  the  image  Is  being  processed  in  the  PCM  mode. 
Switching  within  a  frame  may  be  done  at  the  macroblock  level,  with  some  restrictions.  Othenwise, 
editing  during  frames  requires  decompression  and  recompression,  with  a  small  loss  in  quality  do  to 
concatenation.  However,  it  Is.  anticipated  that  most  editing  will  be  done  prior  to  compression  to  the 
transmission  rate. 

No  provisions  have  been  reported  for  special  effects  in  the  system  tested. 
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III.  Extensibility 

A.  to  no  visible  artifacts 

The  proponent  reports  simulating  compression  at  30  Mbps  with  favorable  results.  He  believes 
that  the  algorithm  can  be  extended  to  40-45  Ktops,  constituting  a  distribution  level  of  quality  suitable  for 
network  feeds  to  local  affiliates.  The  proponent  has  announced  that  he  is  investigating  an  approach 
which  would  allow  the  transmission-level  signal  to  be  included  in  the  distribution  level  signal  as  a  kernel. 
This  would  permit  pass-through  of  the  transmission-level  signal  at  the  local  affiliate  level  by  stripping 
away  the  distribution-level  augmentation. 

B.  to  studio-oualitv  data  rate 

The  proponent  speculates  that  studio-quality  intraframe  compression  can  be  achieved  at  a  bit 
rate  in  the  100-200  Mbps  range.  This  format  has  not  been  developed  yet. 

C.  to  higher  resolution 

The  proponent  believes  that  DigiCipher  technology  is  extensible  and  suggests  a  resolution 
increase  by  a  factor  of  about  four. 

D.  Provision  for  future  compression  enhancement 

The  proponent  suggests  that  as  decreasing  digital  processing  costs  enable  increasing 
complexity  at  the  encoder,  improvements  can  be  made  without  changing  the  receivers  or  the 
transmitted  bit  rate.  These  improvements  are  in  forward  anaiysis,  perceptual  analysis,  motion 
compensation,  coefficient  quantization,  and  special  effects  editing. 
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System  Description 

The  Digital  Spectrum  Compatible  HDTV  System  is  an  all-digital  system  that  is  progressively- 
scanned  with  787.5  lines  per  frame  and  59.94  frames  per  second.  The  display  format  has  square 
pixels  with  720  lines  by  1280  pixels  per  line  in  a  16:9  image  aspect  ratio.  A  feature  of  the  system  is 
dual-rate  coding  where  each  data  segment  is  assigned  a  priority.  The  most  important  picture 
information  on  a  scene-by-scene  basis  is  transmitted  as  two-level  vestigial  sideband  (2-VSB)  while  the 
remaining  picture  information  is  transmitted  as  4-VSB.  This  allows  a  degree  of  graceful  degradation  on 
the  edge  of  the  broadcast  contour  and  transmission  robustness  within  the  service  area.  It  also  allows  a 
degree  of  bit-stream  scalability.  It  implies  a  picture-dependent  variable  information  rate,  achieved  with 
a  constant  symbol  rate.  A  pilot  carrier  is  used  for  easier  signal  acquisition  because  the  pilot  carrier  is  in 
a  part  of  the  band  where  co-channel  NTSC  receivers  are  not  sensitive. 

I.  Interoperability 

A.  with  Cable  TV 

The  tests  of  the  proponent  systems  at  ATTC  by  Cable  Labs  should  determine  the  critical 
aspects  of  interoperability  with  cable  TV.  Other  factors  that  may  aid  cable  TV  interoperability  include 
channel  augmentation  to  9  or  12  MHz  for  improved  picture,  or  better,  going  to  a  higher  data  rate  that 
can  be  supported  in  the  6  MHz  channel  by  the  relatively  benign  cable  environment.  The  proponent 
claims  the  hardware  implementation  of  a  16-VSB  transmission  format  to  achieve  a  43  Mbps  data  rate, 
which  can  deliver  two  DSC-HDTV  programs  over  a  single  cable  channel  without  any  perceptible 
Increase  in  the  cost  of  terrestrial  broadcast  receivers. 

Encryption  and  addressing  are  important  service  features  for  Cable  TV.  See  Section  II,  Items  F 

and  G. 


B.  with  digital  technoloov 

Since  this  system  is  ail-digital,  the  advantages  of  all-digital  systems  apply. 

C.  Headers/descriptors 

The  prototype  tested  did  not  have  explicit  headers  and  descriptors.  However,  ancillary  data 
space  was  provided  for  a  number  of  purposes  including  headers/descriptors.  A  data  format  is  presently 
under  development  that  includes  a  minimum  of  fixed  data.  Within  that  fixed  data  is  a  location  system 
header  that  specifies  the  use  of  the  remaining  data.  The  use  of  data  is  flexible,  and  the  header  deals 
with  error  correction  and  priority  transmission  of  the  data.  Detailed  information  was  provided  to  the 
Review  Board. 

D.  with  NTSC 

As  the  Digital  Spectrum  Compatible  HDTV  system  is  directly  related  to  NTSC,  transcoding  to 
NTSC  Is  straightforward.  Conversion  to  and  from  NTSC  has  been  demonstrated  using  real-time 
hardware.  Edge-crop  conversion  to  CCIR  601  involves  discarding  1 60  HD  samples  from  each  end  of 
the  horizontal  line  and  4:3  Interpolation  of  the  remaining  samples.  It  can  involve  3:1  and  2:1  vertical 
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interpolations  in  alternate  fields.  For  letter-box,  16:9  horizontal  interpolation  is  required,  as  well  as  3:1 
vertical  interpolation  of  the  active  lines.  For  edge-crop  display,  the  480  lines  can  be  placed  in  a  specific 

set  of  the  483  active  lines.  The  first  (odd)  field  can  be  displayed  on  lines  21,  23 .  499,  and  the 

second  (even)  field  Is  displayed  on  lines  22, 24, ....  500.  The  active  lines  501  and  503  of  the  odd  fields 
and  502  of  the  even  fields  can  be  displayed  as  black.  For  letter-box  display,  the  vertical  interpolation  is 
more  complicated.  Up-conversion  from  NTSC  requires  line  tripling,  horizontal  line-rate  conversion  and 
interpolation. 

E.  with  film 

The  encoder  buffer  control  automatically  detects  the  presence  of  24-frame/second  or  30- 
frame/second  scene  material  from  film  sources.  When  a  film  source  is  detected,  an  alternate  buffer 
control  algorithm  is  used  which  takes  advantage  of  repeated  frames  in  the  source  and  minimizes 
variations  In  distortion  between  repeated  frames.  If  film  is  detected,  all  video  segments  undergo  two- 
level-mode  transmission  for  maximum  coverage  area  and  minimum  video  data  rate.  The  current 
Implementation  of  the  alternate  buffer  control  for  film  mode  was  not  completed  in  time  for  testing  at  the 
ATTC.  However,  it  has  since  been  completed  and  demonstrated. 

F.  with  computers 

Progressive  scanning  and  square  pixels,  both  used  in  this  system,  are  important  factors  for 
interoperability  of  an  HDTV  system  with  computers  -  nearly  all  existing  bit-mapped  computer  graphics 
displays  have  these  features.  Progressive  scanning  and  square  pixels  are  most  critical  for  real-time 
applications  such  as  display,  scan-conversion,  frame  capture,  and  video  effects.  Computers  are 
expected  to  play  an  increasing  role  in  video  image  generation  and  production,  and  it  is  desirable  to 
have  an  HDTV  format  which  allows  direct  display  and  manipulation  of  HDTV  video  on  the  computer. 
Progressive  scanning  is  preferable  for  computer  applications  to  avoid  artifacts  that  are  common  with 
interlaced  display  of  computer  generated  imagery.  Also,  scan  conversion  between  Interlaced  and 
progressive  systems  can  produce  undesirable  artifacts.  Square  pixels  are  Inherent  to  graphics  display 
hardware  for  all  popular  computers  employing  bit-mapped  displays.  The  main  reason  Is  that  it 
facilitates  processing  of  2D  transformations,  especially  rotations.  It  also  helps  format  conversion  and 
extension  to  higher  resolution. 

G.  with  satellites 

Digital  signals  with  bit  rates  used  by  the  proponents  for  terrestrial  transmission  can  generally  be 
accommodated  in  the  available  bandwidth  of  a  satellite  transponder.  The  maximum  total  data  rate  for 
the  DSC-HDTV  system  in  the  mode  tested  by  ATTC  is  21.5  Mbps.  As  satellite  data  communication 
channels  use  a  constant  bit  rate,  the  variable  bit  rate  used  by  DSC  for  terrestrial  transmission  makes  it 
necessary  for  the  bit  stream  to  be  reformatted  for  satellite  transmission.  The  reformatted  bit  stream 
must  contain  the  data  needed  to  pemilt  reconstruction  of  the  variable-rate  bit  stream  for  VSB-2/4 
terrestrial  modulation.  Nevertheless,  DSC-HDTV  has  the  lowest  data  rate  of  any  of  the  all-digital 
proponents.  The  proponent  has  suggested  two-channel  TDM  and  two-channel  SCPC  in  a  36-Mhz 
transponder  and  two  and  one  channel  DBS  scenarios. 

H.  with  packet  networks 

The  DSC  symbols  are  organized  in  data  segments  that  resemble  packets.  The  segments  make 
up  data  frames  of  duration  1/59.94  sec.  In  order  to  carry  DSC  on  an  ATM  network,  the  data  in  data 
frames  would  be  encapsulated  in  the  ATM  cell  structure.  However,  circuit-switched  network  use 
constant  bit  rate,  while  the  number  of  bits  in  a  data  frame  varies  because  of  the  two-level  transmission. 
The  proponent  suggests  repeating  two-level  segments  for  added  robustness  to  fill  out  the  data  stream 
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for  a  constant-bit-rate  channel.  For  a  packet  network,  packets  can  be  used  as  needed  to  carry  the 
lesser  actual  bit  rate. 

When  ceil  loss  Is  detected,  the  decoder  performs  error  concealment  by  replacing  missing 
segments  with  default  data  or  with  pixel  data  from  a  previous  frame.  The  error  concealment  algorithm 
was  oniy  partially  implemented  for  ATTC  testing,  but  has  since  been  fully  implemented. 

I. .  with  ■interactive.systems 

The  picture  achieves  full  quality  within  a  few  frames  after  a  scene  change.  Full  quality  is 
obtained  within  one  second  after  a  channel  change. 

Latency  is  the  time  delay  between  a  video  frame  going  into  the  encoder  and  the  corresponding 
frame  coming  out  of  the  decoder  in  the  back-to-back  mode.  It  can  be  important  in  interactive 
applications.  Frame  delays  are  required  at  the  transmitter  and  at  the  receivers  for  coding  and 
decoding.  Rate  buffers  at  the  transmitter  and  at  the  receiver  cause  additional  delays.  Furthermore,  the 
system  implemented  for  test  may  not  be  representative  of  the  minimum  delay  possible.  Comparison  of 
latency  between  the  proponent  systems  is  difficult  at  this  time. 

The  proponent  claims  that  the  delay  through  the  encoder  and  decoder  for  the  DSC-HDTV 
system  is  about  14  frames  (224  ms).  An  enhancement  to  the  current  system  allows  the  latency  to  be 
determined  by  the  encoder,  in  case  lower  latency  is  required  for  interactive  applications. 

J.  Format  conversion 

1.  with  1125/60- 

Up-converting  to  the  Common  Image  Format  (1920  x  1080)  Is  easily  done  by  2:3  interpolation 
horizontally  and  vertically.  SMPTE  240M  currently  uses  1035  active  lines.  Colorimetry  is  SMPTE 
240M. 


2.  with  1250/50 

Conversion  between  systems  with  different  frame  rates  is  the  most  difficult  type  of  conversion 
presently  being  done.  Digital  conversion  between  59.94  fields  per  second  and  50  fields  per  second 
requires  a  number  of  frame  stores  and  very  large  processing  capability.  Present  methods  that  involve 
frame  dropping  lead  to  jerky  motion,  but  other  techniques  can  produce  acceptable  images  under  most 
conditions.  This  difficult  conversion  is  somewhat  easier  with  a  progressive  system  such  as  DSC  than 
with  an  interlaced  system. 

3.  with  CCIR  601/60 

The  proponent  claims  that  conversion  to  and  from  CCIR  601/60  has  been  demonstrated  with 
excellent  results.  Commercially  available  line  doublers  were  used  to  convert  the  CCIR  601  to 
progressive  form,  followed  by  a  spatial  interpolation. 

4.  with  MPEG 

Although  the  DSC-HDTV  decoder  shares  many  commonalties  with  MPEG1  decoders,  the  DSC- 
HDTV  decoder  would  require  modification  to  decode  MPEG1 . 

5.  with  still  image 

The  proponent  suggests  that  conversions  with  JPEG,  Photo  CD  and  CDl  are  possible  with 
straightforward  spatial  filtering  after  decompression,  without  the  temporal  effects  that  might  be 
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introduced  in  an  interlaced  TV  format.  In  simple  cases,  line  and  sample  doubling  or  sub-sampling  will 
suffice. 


K.  Scalability 

It  is  possible  to  process  the  two-level  data  only  and  display  the  images  wrresponding  to  that 
portion  of  the  video  information.  Film  scenes  and  many  normal  video  scenes  will  use  almost  entirely 
two-level  coding  for  video,  so  the  two-level  data  will  result  In  virtually  full-quality  video.  The  decoding  of 
only  the  two-level  data  may  result  in  a  reduced-quality  image  for  scenes  that  are  difficult  to  encode  and 
that  therefore  require  substantial  four-level  data.  In  such  a  case,  the  loss  of  the  four-level  data  affects 
both  the  temporal  and  spatial  resolution.  The  error-concealment  algorithm  repiaces  lost  slices  with 
information  from  a  previous  frame.  Another  spatial  scaling  method  would  produce  a  lower  resolution 
picture  by  extracting  a  lowest  frequency  subset  of  the  DCT  coefficients  from  each  compressed  slice. 
Extraction  of  only  the  DC  coefficients  would  produce  a  160H  x  90V  pixel  image.  Scalability  by 
extracting  DC  (or  other  low-order)  DCT  coefficients  can  be  implemented  in  any  proposed  digital  system. 
Where  temporal  scaling  is  needed,  the  process  is  simplified  by  the  progressive  scan  used  in  DSC.  The 
proponent  has  suggested  using  the  motion  vectors  available  at  the  decoder  to  perform  motion- 
compensated  frame  interpolation. 

The  proponent  suggests  that  picture-in-picture  be  done  by  windowing  on  slice  (64H  x  48V 
pixels)  boundaries. 


11.  Scope  of  services  and  features 

A.  Initial  Use  of  Ancillary  Data 

The  proponent  has  provided  charts  showing  a  way  in  which  flexibly  allocated  data  may  be  used 
by  means  of  headers.  See  Section  I.C,  Headers/Descriptors. 

B.  Audio 

The  DSC-HDTV  system  provides  four  independent  125,874  bps  audio  channels  multiplexed  with 
the  video  data.  The  prototype  hardware  used  the  Dolby  AC-2  compression  equipment.  The  main 
stereo  pair  is  transmitted  as  two-level  VSB,  whereas  the  secondary  pair  is  transmitted  as  four-level 
VSB.  The  two-level  data  has  a  6  dB  lower  threshold  at  the  fringe  area. 

The  newly  proposed  use  of  headers  provides  a  flexible  composite  surround-sound  method, 
described  In  ATSC  document  T3/186,  In  place  of  the  original  fixed  allocation.  It  also  provides  a 
separate  stereo  pair  suitable  for  a  second  language. 

C.  Data 

Two  separate  channels  were  provided  for  ancillary  data  In  the  system  tested.  The  total  capacity 
of  412,867  bps  was  divided  Into  one  channel  of  30,210  bps  sent  as  2-level  data  and  another  of  382,657 
bps  sent  as  4-level  data.  However,  the  newly  proposed  header  usage  allows  a  much  more  flexible 
assignment  of  andllary  data  capability.  Assuming  the  presence  of  video,  five-channel  audio,  and 
captioning,  and  that  all  the  ancillary  data  Is  carried  as  four-level  data,  the  ancillary  data  rate  is 
493,426.57  bps. 

D.  Text 

The  technical  description  dated  2/22/91  suggested  that  35  bytes  per  data  field  or  16,783  bps  be 
used  for  teletext.  The  new  proposal  would  flexibly  allocate  text  data  by  use  of  headers  and  descriptors. 
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E.  Captioning 

The  technical  description  dated  2/22/91  suggested  that  1  byte  per  data  field  or  480  bps  be  used 
for  captioning.  The  new  proposal  would  allocate  9.6  kbps  per  ATSC  Document  T3/186. 

F.  Encryption 

The  prototype  tested  did  not  have  encryption  implemented.  The  proponent  expects  to  develop 
encryption  with  industry  participation. 

G.  Addressing 

The  use  of  headers  under  development  allows  program  quality  to  be  traded  temporarily  for 
addressing  performance.  An  extreme  mode  of  the  system  in  which  most  of  the  data  rate  is  dedicated  to 
addressing  would  allow  addressing  of  more  than  5.7  million  subscribers  per  minute.  In  this  mode,  other 
auxiliary  data  would  be  suppressed,  and  a  minimum  quality  of  audio/video  service  would  be  maintained. 

H.  Low-cost  receiver 

The  proponent  described  an  algorithm  for  producing  a  640H  x  360V  pixel  image  by  extracting 
one  out  of  four  8x8  blocks  of  pixels  and  tiling  them.  The  resulting  picture  is  said  to  exceed  the  quality  of 
NTSC  on  a  2“  LCD  display.  It  is  also  reported  possible  to  make  a  low-cost  receiver  with  just  the  two- 
level  data. 

I.  VCR  capability 

The  proponent  claims  that  the  current  S-VHS  mechanism  is  sufficient  for  the  21.5  Mbps 
compressed  DSC-HDTV  data,  and  that  such  1/2  Inch  cassette  equipment  existe  In  prototype  form. 

According  to  the  proponent,  the  system  knows  what  fraction  of  the  original  image  Is  contained  in 
the  displaced  frame  difference  (DFD).  A  usable  picture  is  obtained  without  motion  compensation  by 
amplifying  the  DFD  by  a  factor  proportional  to  the  inverse  of  the  DF  factor.  This  can  be  used  for  VCR 
forward  or  reverse  scan  modes  when  only  a  small  portion  of  each  compressed  frame  is  acquired.  In 
addition,  the  segment  headers  are  needed  to  identify  the  slice  numbers  from  fte  acquired  data.  The 
picture  would  appear  "blocky"  with  some  slices  lost,  but  suitable  for  rapid  searching. 

Still  frame  is  simple  if  the  VCR  had  been  playing.  If  random  access  to  a  particular  fr^e  on  the 
tape  is  required,  the  decoding  of  several  frames  leading  up  to  It  is  needed  to  achieve  full  quality. 

Splicing  is  optimal  if  each  splice  starts  with  a  scene  change.  Otherwise,  the  decoder  can  be 
signaled  to  initiate  a  leak  factor  inversion  for  fast  startup  at  the  beginning  of  each  splice  or  insert. 

Cropping  is  possible  by  manipulation  or  replacement  of  compressed  slices. 

image  processing  for  special  effects  is  best  performed  in  the  pixel  domain  after  decoding. 
Square  pixels  and  progressive  scanning  simplify  the  implementation  of  spedal  effects. 


ill.  Extensibility 

A.  to  no  visible  artifacts 

It  was  the  reported  intention  of  this  proponent  to  have  no  visible  artifacts  at  the  terrestrial 
broadcast  data  rate,  when  viewed  from  a  distance  of  3  times  the  picture  height.  This  goal  is,  however, 
challenged  by  complexity  and  motion.  The  proponent  suggests  a  rate  of  41  Mbps  for  no  visible  artifacts 
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regardless  of  detail  and  motion,  and  states  that  this  can  be  accomplished  with  a  small  change  to  the 
compressed  video  interface. 

B.  to  studio-Qualitv  data  rate 

Claims  are  made  that  the  compression  techniques  used  for  the  broadcast  of  DSC-HDTV  are 
easily  simplified  to  produce  a  200  Mbps  signal  for  use  in  the  studio.  This  signal  uses  only  intraframe 
processing,  and  thus  Is  suitable  for  all  editing  and  special  effects  processing.  The  claim  is  made  that 
the  quality  may  be  suitable  for  multiple  decoding/encoding  as  required.  The  bit  rate  is  suitable  for  serial 
data  interfaces  and  also  for  video  tape  recording  on  D-1  VTRs. 

C.  to  higher  resolution 

If  it  is  desirable  in  the  future  to  maintain  higher  pixel  numbers  in  the  production  studio,  this  can 
also  be  accommodated  in  an  extension  of  the  studio  compression  format,  by  compressing  the  higher- 
resolution  signal  rate  into  the  200  Mbps  studio  compression-standard  plus  a  high-frequency  residual 
signal.  The  standard  DSC-HDTV  system  codes  the  HD  frames,  and  a  simple  augmentation  encoder 
codes  the  residual  signal.  The  final  output  of  editing  or  special  effects  can  still  be  recorded  using  the 
200  Mbps  portion  of  the  compressed  signal. 

D.  Provision  for  future  compression  enhancement 

The  compression  algorithm  permits  improvements  in  the  selection  of  vector  quantization 
patterns,  motion  estimation,  perceptual  error  threshold  computation,  buffer  control,  leak  adaptation,  and 
transmission  prioritization.  These  Improvements  can  be  made  without  change  to  the  receivers  or  the 
transmitted  data  rate. 
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System  Description 

The  Narrow-MUSE  system  is  an  analog  transmission  simulcast  system.  The  input  signal  format 
and  the  display  signal  format  is  1125  scanning  lines  per  frame,  60.00  fields  per  second,  and  2:1 
interlace.  The  transmission  format,  however,  is  750  scan  lines  per  frame,  60.00  fields  per  second,  and 
2:1  interlace.  The  video  encoding  method  is  Multiple  Sub-Nyquist  Sampling  Encoding,  and  the  audio 
encoding  method  is  near-instantaneous  DPCM.  Transmission  is  by  analog  amplitude  modulation. 

i.  Interoperability 

A.  with  Cable  TV 

The  tests  of  the  proponent  systems  at  ATTC  by  Cable  Labs  should  determine  the  critical 
aspects  of  interoperability  with  cable  TV.  Other  factors  that  may  aid  cable  TV  Interoperability  include 
channel  augmentation  to  9  or  12  Mhz  for  Improved  service.  MUSE-E  of  8.1  Mhz  may  satisfy  this 
criterion.  An  even  more  desirable  feature  for  a  system  to  offer  would  be  a  higher-quality  mode  within 
the  6-Mhz  channel.  Digital  systems  can  do  this  by  sending  more  bits  per  symbol,  for  use  over  quiet 
channels  such  as  cable.  Since  the  Narrow-MUSE  system  is  an  analog  system,  it  cannot  be  enhanced 
in  this  manner. 

Encryption  and  addressing  are  other  important  service  features  for  cable  TV.  See  Section  II, 
Hems  F  and  G. 

B.  with  digital  technology 

Since  the  transmitted  signal  is  analog,  H  must  be  digitized  before  interfacing  with  digital 
technology.  However,  all  of  the  signal  processing  in  the  encoder,  modulator,  demodulator  and  decoder 
is  done  in  the  digital  domain,  and  custom  LSI's  are  available.  A  digital  Interface  port  is  provided  in  the 
receiver. 

C.  Headers/descriptors 

This  system  provides  128  kbps  of  ancillary  data.  Although  the  partition  of  the  ancillary  data  has 
not  yet  been  specified,  the  proponent  states  that  the  headers/descriptors  as  a  part  of  SMPTE  240M 
and  260M  could  be  assigned  into  the  ancillary  data  channel  of  Narrow-MUSE. 

D.  with  NTSC 

There  are  two  conversion  methods  to  NTSC.  One  is  from  the  750-line  Narrow-MUSE 

transmission  format  to  NTSC,  and  the  other  is  from  1125/60  to  NTSC.  Since  the  aspect  ratio  of  this 
system,  16:9,  is  different  from  the  aspect  ratio  of  NTSC,  45,  a  choice  must  be  made  for  the  conversion. 
The  limiting  choices  for  downconversion  are  1)  "Edge  Crop",  in  which  the  HDTV  picture  fills  483  lines  on 
NTSC  with  loss  of  the  sides  of  the  HDTV  picture,  and  2)  "Letterbox",  in  which  the  full  width  of  the  HDTV 
picture  is  displayed  in  the  full  width  of  the  NTSC  picture,  but  leaving  unused  areas  at  the  top  and  the 
bottom  of  the  NTSC  screen. 

The  conversion  from  Narrow-MUSE  to  NTSC  requires  only  vertical  interpolation  because 
Narrow-MUSE  employs  an  analog  transmission  technique.  Since  the  number  of  active  lines  in  Narrow- 
MUSE  is  651,  the  vertical  interpolation  ratio  is  31 53  for  "Edge-Crop"  and  is  9:5  for  the  letterbox  format. 
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The  interpolation  ratio  for  1125  to  NTSC  is  15:7  vertically  and  2:1  horizontally.  In  both  conversions, 
field-rate  conversion  from  60.00  Hz  to  59.94  Hz  is  required.  The  proponent  claims  that  a  motion- 
adaptive  field-rate  converter  is  available,  and  has  been  publicly  demonstrated.  The  proponent  also 
claims  that  the  converter  for  1125/60  to  NTSC  is  used  for  the  daily  simulcast  operation  In  Japan,  that 
the  converter  from  full-band  MUSE  to  home  display  is  sold  on  the  market,  and  that  the  same  technique 
can  be  applied  to  Narrow-MUSE. 

E.  with  film 

This  system  does  not  have  a  film  mode  within  its  encoding  algorithm.  Since  the  field  rate  of  this 
system  is  60  Hz,  the  temporal  conversiort  from  film  to  HDTV  is  accurate.  A  motion-compensated 
continuous-film-transfer  teledne  is  already  available  for  this  system. 

F.  with  computers 

For  computers,  a  progressive  display  is  overwhelmingly  preferred.  Interlaced  systems  such  as 
this  have  an  inherent  compatibility  problem  with  computers,  even  for  just  displaying  the  computer's 
video  output.  Progressive  scanning  is  preferable  for  computer  applications  to  avoid  artifacts  that  are 
common  with  interlaced  displays  of  computer-generated  imagery.  Also,  scan  conversion  between 
Interlaced  and  progressive  systems  can  produce  undesirable  artifacts.  Square  pixels  are  also 
overwhelmingly  preferred  for  graphics  display  hardware  for  all  computers  employing  bit-mapped 
displays,  as  It  facilitates  processing  of  2D  transformations,  especially  rotations.  Non-square  pixels  do 
not  present  a  problem  for  the  display  of  RGB  signals  from  a  computer's  video  card  on  HDTV  receivers, 
but  would  complicate  more  sophisticated  attempts  at  Interoperability,  such  as  the  display  by  an  HDTV 
receiver  of  a  picture  from  a  digital  data  stream  generated  by  a  computer.  According  to  the  proponent, 
the  pixel  shape  of  240M  is  1  : 1.043  (H  :  V),  and  1125/60  signals  have  already  been  manipulated  for 
graphic  purposes.  The  shape  of  a  decoded  Narrow-MUSE  pixel  is  1  : 0.56.  The  proponent  claims  that 
the  field  rate  of  60.00  Hz  is  a  better  selection  than  59.94  Hz  for  interoperability  with  computers  that 
have  integer  field  rates  such  as  72  Hz. 

G.  with  satellites 

This  system  can  be  transmitted  through  a  satellite  using  FM  with  an  RF  channel  bandwidth  of 
approximately  15  MHz.  FM  transmission  of  MUSE  through  a  satellite  is  a  proven  technology.  The 
proponent  claims  that  Narrow-MUSE  also  can  be  transmitted  through  a  satellite  using  digital 
transmission.  The  Narrow-MUSE  signal  can  be  encoded  by  DPCM  to  a  data  rate  of  approximately  40 
Mbps  which  includes  error  correction  of  8%  (3.2  Mbps).  Satellite  links  typically  use  more  error 
correction  than  this,  e.g.  14%  to  50%.  The  RF  channel  bandwidth  with  QPSK  is  approximately  24  MHz. 
Digital  transmission  of  MUSE  In  conjunction  with  DPCM  is  also  a  proven  technology. 

H.  with  packet  networks 

This  item  is  not  applicable  because  this  system  employs  analog  transmission. 

I.  with  interactive  systems 

The  acquisition  time  is  0.5  seconds. 


27 


DRAFT 


Narrow-MUSE 


Latency  is  the  time  delay  between  a  video  frame  going  into  the  encoder  and  the  corresponding 
frame  coming  out  of  the  decoder  In  the  back-to-back  mode.  It  can  be  important  in  interactive 
applications.  Frame  delays  are  required  at  the  transmitter  and  at  the  receivers  for  coding  and 
decoding.  Because  the  digital  systems  have  other  delays,  comparison  of  latency  between  the 
proponent  systems  is  difficult  at  this  time. 

According  to  the  proponent,  the  total  delay  for  Narrow-MUSE  through  an  encoder  and  a  decoder 
is  6  fields  (approximately  100  msec),  3  fields  for  each. 

J.  Format  conversion 

1.  with  1125/60 

No  format  conversion  is  required  because  this  system  uses  1125/60  format  (SMPTE  240M). 
The  decoded  Narrow-MUSE  signal  can  be  converted  to  the  Common  Image  Format  through  a  line- 
number  conversion  and  a  sampling-frequency  conversion.  These  are  24:23  vertically  and  50:27 
horizontally. 


2.  with  1250/50 

Conversion  between  systems  with  different  frame  rates  is  the  most  difficult  type  of  conversion 
presently  being  done.  However,  this  system  makes  the  frame-rate  conversion  slightly  easier  than  other 
systems,  because  its  field  rate  is  exactly  60.00  Hz.  The  proponent  claims  that  the  conversion  from  60 
Hz  to  50  Hz  has  been  successfully  demonstrated  with  a  standard  converter  from  1 125/60  to  PAL.  This 
standard  converter  employed  Interpolation  based  on  motion  vectors.  The  vertical  Interpolation  ratio 
from  1125  to  1250  is  9:10. 

3.  with  CCIR  601/60 

The  vertical  interpolation  ratio  between  this  system  and  CCIR  601/60  is  15:7  for  both  total 
scanning  lines  (1125:525)  and  active  scanning  lines  (1035:483).  The  horizontal  conversion  ratio  is  2:1. 
The  clock  frequency  for  this  system,  74.25  MHz,  and  for  CCIR  601,  13.5  MHz,  have  a  relationship  of 
11:2. 

Because  the  field  frequency  for  this  system  is  60.00  Hz,  field-rate  conversion  from  60.00  Hz  to 
59.94  Hz  is  required.  A  motion-adaptive  field-rate  converter  is  available  and  is  used  daily  for  simulcast 
in  Japan.  A  frame  skip  must  take  place  every  33  seconds,  because  of  the  1001/1000  frame 
conversion.  The  hardware  attempts  to  do  this  cut  on  a  motionless  picture  or  on  a  scene  change. 

4.  with  CCIR  601/50 

The  vertical  conversion  ratio  between  Narrow-MUSE  and  CCIR  601/50  is  9:5  for  both  total 
scanning  lines  (1125:625)  and  active  scanning  lines  (1035:575).  The  horizontal  ratio  is  2:1.  The  clock 
frequency  for  Narrow-MUSE,  74.25  MHz  and  for  CCIR  601,  13.5  MHz,  have  a  relationship  of  11:2. 
Also,  see  Hern  2  above. 

5.  with  MPEG 

Narrow-MUSE  does  not  have  interoperability  with  MPEG. 
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6.  with  still  image 

An  1125/60  still  image  disk  system  based  on  the  JPEG  algorithm  has  been  developed  and 
demonstrated.  The  system  uses  multiple  disks,  with  video,  audio,  and  control  data  recorded  on 
separate  disks.  Narrow-MUSE  does  not  have  compatibility  with  JPEG,  Photo  CD,  or  CDI. 

K.  Scalability 

This  system  uses  a  multiple  sub-sampling  technique  with  a  four-field  sequence.  Therefore,  the  ' 
spatial  resolution  of  the  reconstructed  picture  can  be  controlled  by  selecting  fields  to  be  used  for  the 
interpolation.  When  all  four  fields  are  used,  a  full-quality  picture  Is  obtained.  When  one  of  every  four 
fields  is  used,  a  picture  with  reduced  resolution  can  be  obtained  by  interpolation.  Also,  a  picture  with 
reduced  size  can  be  obtained  by  using  only  a  selected  field. 

The  proponent  claims  that  Narrow-MUSE  is  a  member  of  the  MUSE  family,  which  is  based  on 
the  concept  of  scalability.  The  MUSE  family  consists  of  MUSE-T,  MUSE-E  (full-band  MUSE).  Narrow- 
MUSE.  and  NTSC  MUSE-4,  all  based  on  the  same  coding  algorithm.  /Ul  these  systems  have  been 
demonstrated. 

For  display  on  a  computer,  pictures  reduced  by  1/2^  can  be  made  wth  only  intra-field 
information.  Other  ratios  require  more  processing. 

This  system  uses  a  multiple  sub-sampling  technique  with  a  four-field  sequence.  Therefore,  the 
temporal  resolution  of  the  reconstructed  picture  can  be  controlled  by  selecting  fields  to  be  used  for  the 
interpolation.  When  all  four  fields  are  used,  full  temporal  resolution,  1/60  sec,  is  obtained.  To  reduce 
the  amount  of  data,  the  field  repetition  rate  can  be  reduced  for  pictures  with  less  temporal  resolution. 

The  multiple  sub-sampling  technique  makes  possible  two  types  of  receivers  differing  in 
complexity.  A  simple  receiver  can  be  built  that  handles  only  intra-field  Interpolation,  while  the  full- 
capability  receiver  handles  both  intra-field  and  Inter-field  interpolation. 

T^e  low-frequency  component  below  2  MHz  of  the  Narrow-MUSE  signal  does  not  contain  the 
aliasing  component  caused  by  frame  offset  sub-sampling.  Therefore,  a  picture  whose  quality  is 
equivalent  to  NTSC  can  be  reproduced  by  using  only  this  low-frequency  component. 

Picture-in-pIcture,  pIcture-out-of-pIcture.  and  multiple  programs  can  be  accommodated  using 
only  the  intra-field  information  from  the  Narrow-MUSE  signal.  A  frame  store  in  the  receiver  can  be  used 
for  this  purpose. 

il.  Scope  of  services  and  features 

A.  initial  use  for  ancillary  data 

The  partition  of  the  ancillary  data  has  not  been  specified  yet. 

B.  Audio 

This  system  provides  two  audio  modes.  Both  modes  employs  Near-instantaneous  Companding 
DPCM  for  coding.  Mode  A  can  transmit  four  audio  channels  with  15-kHz  bandwidth,  the  data  rate  of 
which  is  1056  kbps/channel  without  error  protection.  Mode  B  can  transmit  two  audio  channels  with  20- 
kHz  bandwidth,  the  data  rate  of  which  is  1072  kbps/channel  without  error  protection.  Since  the  data 
capacity  of  the  audio  channel  is  relatively  high,  modifications  can  be  made  to  increase  the  number  of 
audio  channels. 
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C.  Data 

This  system  provides  128  kbps  of  ancillary  data.  This  can  be  increased  by  modifying  the  audio 
encoding  method.  The  interface  for  the  data  channel  is  RS-422. 

D.  Text 

Teletext  data  is  transmitted  using  the  ancillary  data  channel.  The  proponent  suggests  a  data 
rate  of  1 6  kbps.  He  also  suggests  that  the  number  of  characters  per  picture  is  between  1 000  and  4000,  • 

to  be  transmitted  in  about  3  seconds.  These  numbers  can  vary,  based  on  the  service  required. 

E.  Captionino 

Captioning  data  is  transmitted  using  the  ancillary  data  channel.  The  proponent  suggests  a  data 
rate  of  0.6  kbps.  This  number  might  vary,  based  on  the  service  required. 

F.  Encryption 

The  system  submitted  for  testing  did  not  include  encryption.  The  proponent  suggests  a 
combination  of  line  rotation  and  line  permutation  for  signal  security.  Decoder  chips  for  the  encryption 
system  are  already  developed. 

G.  Addressing 

The  proponent  claims  that  the  data  rate  needed  for  addressing  and  sending  decryption  keys  is 
approximately  16  kbps,  and  that  this  rate  will  permit  the  addressing  of  10,000,000  subscribers  in 
approximately  two  days.  However,  this  addressing  rate  is  not  sufficient  for  a  pay-per-view  environment. 
The  addressing  information  is  transmitted  through  the  ancillary  data  channel. 

H.  Low-cost  receiver 

This  system  has  two  modes  in  the  receiver,  i.e.  stationary  mode  and  motion  mode.  The 
stationary  mode  requires  more  memory  and  a  more  complicated  interpolation  than  the  motion  mode. 

By  using  only  the  motion  mode,  low-cost  receivers  can  be  built  wth  reduced  resolution.  The  quality  is 
higher  than  with  NTSC  on  a  2"  LCD  receiver  because  the  resolution  is  almost  the  same,  and  neither 
cross-luminance  nor  cross-color  is  observed.  The  proponent  claims  that  a  reduced-cost  receiver  is 
already  available  on  the  market  for  the  full-band  MUSE,  and  that  the  same  configuration  can  be  used 
for  Narrow-MUSE. 

i.  VCR  capability 

The  proponent  claims  that  a  digitized  Narrow-MUSE  signal  with  an  80-Mbps  data  rate  or  a 
DPCM-encoded  Narrow-MUSE  signal  with  a  40-Mbps  data  rate  can  be  digitally  recorded  on  a  1/2  inch 
cassette  VCR.  The  record/playback  time  is  said  to  be  4  to  8  hours.  Since  the  bandwidth  of  Narrow- 
MUSE  is  the  same  as  for  NTSC,  the  requirements  of  a  VCR  are  similar. 

Only  sync  blocks  whose  ID  sign^s  are  detected  correctly  are  used  for  fast  forward  and  reverse. 
Sync  blocte  whose  ID  signals  are  not  detected  correctly  are  discarded,  and  replaced  with  interpolated 
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information.  The  proponent  claims  that  the  quality  of  a  rapid  search  picture  will  be  comparable  to  that 
of  a  4-head  VHS  machine. 

These  functions  can  be  achieved,  based  on  the  four-field  sequence  of  the  Neirrow-MUSE 
algorithm.  Editing  functions  can  be  implemented  by  adjusting  the  subsampling  phases  between  the 
materials  to  be  edited,  using  the  subsampling  phase  information  which  is  transmitted  as  a  part  of  the 
control  signal.  The  four-field  sequence  is  similar  to  that  of  NTSC. 

Special  effects  are  not  done  with  the  Narrow-MUSE  signal.  Rather,  they  are  done  on  the 
1125/60  signal. 

iii.  Extensibility 

A.  to  data  rate  with  no  visible  artifacts 

MUSE-T,  a  higher  member  of  the  MUSE  family,  has  a  bandwidth  of  16.2  MHz  and  can  provide  a 
picture  with  no  visible  artifacts  because  it  employs  only  intra-field  subsampling.  A  digitized  MUSE-T  can 
be  further  compressed  using  DPCM  without  Introducing  additional  artifacts.  The  main  part  of  a  Narrow- 
MUSE  receiver  can  be  shared  for  MUSE-T  decoding  when  MUSE-T  is  transmitted  through  alternate 
media  such  as  DBS.  It  is  also  possible  to  extend  Narrow-MUSE  to  MUSE-T  by  transmitting  the 
difference  between  locally  decoded  Narrow-MUSE  and  MUSE-T  through  an  additional  channel. 

B.  to  studio  Quality  data 

It  is  possible  to  extend  Narrow-MUSE  to  240M  by  transmitting  the  difference  between  the  locally 
decoded  Narrow-MUSE  and  240M  signals  through  an  additional  channel  as  augmentation  Information. 
The  bandwidth  of  the  studio-quality  signal  is  60  MHz  (30  MHz  for  luminance  signal  and  15  MHz  for  each 
color  difference  signal). 

C.  higher  resolution 

MUSE-E  can  be  used  for  higher  resolution.  The  main  part  of  a  Narrow-MUSE  receiver  can  be 
shared  for  MUSE-E  when  MUSE-E  is  transmitted  through  alternate  media  such  as  DBS.  It  is  also 
possible  to  extend  Narrow-MUSE  to  MUSE-E  by  transmitting  the  difference  between  locally  decoded 
Narrow-MUSE  and  MUSE-E  through  an  additional  channel. 

The  proponent  suggests  that  it  is  possible  to  extend  Narrow-MUSE  to  VHDTV  and  UHDTV  by 
transmitting  the  difference  between  the  locally  decoded  Narrow-MUSE  and  VHDTV/UHDTV  through  an 
additional  channel  as  an  augmentation  signal. 

D.  Provision  for  future  compression  enhancement 

The  proponent  claims  that  the  dynamic  resolution  can  be  improved  by  increasing  the  number  of 
motion  vectors.  The  additional  motion  vectors  can  be  transmitted  through  the  data  channel  at  the 
expense  of  data  for  other  purposes. 
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INTRODUCTION 


.  Following  interoperability  presentations  made  on  September  23-25,  1992  by  the 
advanced  television  system  proponents,  the  Review  Board  of  multi-industry  experts 
and  the  system  proponents  were  asked  to  evaluate  the  ATV  systems  with  regard  to 
interoperability,  extensibility,  and  scope  of  services  and  features  using  the  ATV 
Interoperability  Evaluation  Matrix  previously  defined.  An  analysis  of  the  evaluation 
matrices  submitted  in  response  to  this  request  follows  in  this  report. 

The  interoperability  assessment  was  a  qualitative  assessment  process.  The 
definitions  of  the  "ATV  Features"  in  the  evaluation  matrix  are  not  precise  and  while 
"generally  understood"  by  the  Review  Board,  differences  in  understanding  of  the 
features  exists. 

It  was  difficult  to  delineate  (1)  proponent  system  configuration  tested  (2) 
proponent  system  configuration  that  would  be  field  tested  (3)  proponent  system 
configuration  that  would  be  first  delivered  (4)  possible/realistic  future  configurations  for 
a  proponent  system.  As  much  as  possible,  the  evaluations  were  based  on  "what  the 
evaluator  expects  the  proponent  will  deliver  commercially". 

The  weighting  of  the  value  (e.g.  ,  market/social  benefit)  of  interoperability  or 
extensibility  other  than  in  established  markets  (e.g.  terrestrial  broadcast  or  cable 
television,  entertainment,  advertising  and  news)  is  difficult  and  arguable.  Formal 
positions  from  the  affected  industries  are  not  available. 

In  response  to  the  interoperability  investigation,  some  proponents  suggested 
alternative  configurations  from  that  tested  by  the  ATTC  for  improving  interoperability. 
Proponents  were  invited  to  submit  additional  information  on  interoperability  by  October 
1,  1992.  Two  proponents  submitted  such  information  on  that  date  and  this  was 
incorporated  into  the  draft  assessment  dated  October  19,  1992.  Since  the  evaluation 
matrices  were  due  on  October  5,  the  additional  proponent  information  may  not  have 
been  considered  in  some  of  the  evaluations.  For  example,  a  Review  Board  member 
indicated  that  his  rating  for  DigiCipher  packet  network  Interoperability  should  change 
from  "4"  to  "2"  with  the  implementation  of  the  video  packet  and  transwitch  format  layers 
described  in  the  October  1, 1992  DigiCipher  supplemental  information.  Review  Board 
members  were  Invited  to  submit  any  changes  they  wished  to  make  in  their  evaluations 
by  November  2, 1992.  The  revised  evaluations  are  included  in  Appendix  III. 


Proponent  Analysis  of  Other  Systems 


In  general,  each  proponent  evaluated  his  own  system  better  than  his  opponents 
(colleagues)  evaluated  his  system.  In  fact,  one  proponent  did  not  include  the 
evaluation  of  his  own  system  because  of  "possible  conflict  of  interest".  Matrix  I,  on  the 
next  page,  was  done  without  the  proponent's  evaluation  of  his  own  system  to  remove 
this  bias.  Only  two  proponents  evaluated  interoperability  with  cable  TV  and  hence 
this  was  not  included.  The  ratings  of  all  proponents  are  found  in  Appendix  I. 

The  mean  value  found  in  the  tables  is  the  estimate  of  the  mean  based  on  N 
independent  observed  values,  x,-,  of  the  random  variable  x. 


The  standard  deviation  in  the  tables  is  the  unbiased  estimate  given  below. 


Interoperability 

with  Digital  Technology 

The  all-digital  systems  were  rated  approximately  equal.  The  Narrow- 
MUSE  system  was  evaluated  as  difficult  to  interoperate  with  digital  technology. 
The  proponent  did  not  agree  with  this  assessment. 


with  Headers/Descrlptors 

AD-HDTV  was  rated  as  easiest  to  Implement  with  headers/descriptors 
and  Narrow-MUSE  the  hardest  with  no  disagreement  from  the  Narrow-MUSE 
proponent. 
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MATRIX  I  -  PROPONENT  ANALYSIS  OF  OTHER  SYSTEMS 


Mean  (Standard  Deviation) 


ATV  Proponent  System 


ATV  FEATURE 


with  digital  technology 


with  headers/descriptors 


with  NTSC 


with  lilm 


with  computers 


with  satellites 


with  packet  networks 


with  interactive  systems 


Format  conversion 


Scalability 


Scope  of  Services  and 
Features 


Initial  use  for  ancillary  data 


Audio 


Data 


Text 


Captioning 


Encryption 


Addressing 


Low-cost  receiver 
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4.75 

(0.50: 

5.00 

(0.00 

2.63 

(1.11 

3.75 

(0.96 

5.00 

(0.00 

2.50 

(0.58 

5.00 

(0.00 

2.00(1.15)  I  2.00(1.15 


1.50  (1.00 


1.50(1.00 


irasara 


2.25  (0.50 


2.25  (0.96 


2.38  (0.48 


3.00(1.42 


3.00  (1.42 


2.50  (0.58) 


2.50 

(1.00) 

2.25 

(0.96) 

2.25 

(0.96) 

1.50 

(1.00) 

1.75 

(0.96) 

2.75 

(0.50 

2.88 

(0.85 

2.00 

(1.15 

1.00 

(0.00 

2.00 

(1.15 

1.75 

(0.96 

1.75 

(0.96 

1.25 

(0.50 

2.00 

(1.15 

2.00  (1.15 


2.00 

(1.15 

2.00 

(1.15 

2.00 

(1.15 

1.75 

(0.96 

2.50 

(1.00 

2.75 

(1.71 

1.75 

(0.96 

3.38 

(1 .25 

3.13 

(1.44 

3.00 

(1.41 

3.75 

(1.26 

3.50 

(0.58 

2.38 

00 

o 

2.50 

(0.58 

2.75 

(0.50 

3.50 

(0.58 

3.50 

(0.58 

2.75  (0.50) 


^Proponent  rating  of  own  system  not  included. 


3.25  (0.96 


3.25  (0.96 


2.75  (0.50) 


2.50  (0.58 


3.25  (0.96 


3.25  (0.96 


2.75  (0.50) 


4.00  (0.82 


3.63  (1.11 


4.25(1.50 


4.63  (0.75 


4.63  (0.75 


4.25  (0.96) 


1 

-  Easv  to  implement 

3 

•  Moderately  difficult 

5 

•  Very  difficult  to  implement 

2 


with  NTSC 


All  systems  were  evaluated  as  relatively  easy  to  interoperate  with  NTSC. 
DigiCipher  received  a  slight  edge  here. 

with  Film 

DigiCipher  and  CC-DigiCipher  received  a  slight  edge  in  interoperability 
with  film. 

with  Computers 

DSC-HDTV  and  CC-DigiCipher  (progressive  scan/square  pixels)  received 
a  slight  edge  in  interoperability  with  computers. 

with  Satellites 

All  systems  were  judged  as  relatively  easy  to  Interoperate  with  satellites 
with  DigiCipher  and  CC-DigiCipher  having  a  slight  edge. 

with  Packet  Networks 

AD-HDTV  was  judged  significantly  easier  (>2a)  to  Interoperate  with 
packet  networks. 

with  Interactive  Systems 

The  all-digital  systems  were  judged  in  the  same  range  of  difficulty  in 
interoperability  with  interactive  systems,  however,  CCDC  had  a  slight  edge. 

Format  Conversion 

The  progressive  scan  systems  achieved  a  slight  edge  in  ease  of  format 
conversion  with  CCDC  having  a  slight  edge. 

SC9l9bi|{ty 

AD-HDTV  and  DSC-HDTV  received  a  slight  edge  in  scalability. 

Scope  of  Service  and  Features 


initial  use  for  ancillary  data 
No  significant  difference. 
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Audio 


No  significant  difference. 


Bata 

Text 

Captioning 

AD-HDTV  had  a  slight  edge  in  data  services  and  features  but  the  other 
systems  were  not  far  behind. 

Engryptign 

DigiCipher  had  a  slight  edge  in  encryption. 

Addressing 

No  significant  difference  in  addressing  capability  was  shown  among  all- 
digital  systems  although  AD-HDTV  and  CC-DigiCipher  received  a  slight  edge. 

Low-cost  receiver 

No  significant  difference  but  DSC-HDTV  had  a  very  slight  edge. 

VCR  pgpgpility 

AD-HDTV  was  given  an  edge  in  VCR  capability. 


Extensibility 

to  no  visible  artifacts 
to  studio-quality 

No  significant  difference  among  all-digital  systems. 

lo-hlghpr . rgsgiution 

to  VHDTV.  UHPTV 

No  significant  difference  among  all-digital  systems  with  AD-HDTV  having 
only  the  slightest  edge. 
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Provision  for  Future  Compression  Enhancement 


No  significant  difference  among  all-digital  systems  with  AD-HDTV  having 
only  the  slightest  edge. 


Summary  Evaluation 
InteroDerabiiitv 

No  one  system  is  best  for  interoperability  with  all  media.  In  order  to  arrive  at 
some  conclusion,  assume  that  all  features  under  Interoperability  are  weighted 
uniformly.  Then  the  following  Interoperability  table  is  computed  with  mean  (standard 
deviation). 


1  AD-HDTV 

CCDC 

DSC-HDTV 

Narrow-MUSE 

Only  very  slight  differences  are  noted  for  the  all-digital  systems.  AD-HDTV  edged  out 
CCDC,  which  edged  out  the  other  two  all-digital  systems. 

Scone  of  Services  and  Features 

It  is  not  advisable  to  weight  all  features  equally.  Audio,  VCR  capability, 
addressing  and  low-cost  receiver'are  arguably  more  important.  Different  systems  lead 
the  rankings  in  each  category.  Therefore,  no  conclusion  can  be  reached  for  this 
category. 

Extensibility 

Assuming  that  all  features  under  Extensibility  are  weighted  uniformly,  the 
following  Extensibility  table  Is  computed  with  mean  (standard  deviation). 


1  AD-HDTV 

CCDC 

DSC-HDTV 

Narrow-MUSE 

wisasiwaM 

2.90(0.66) 

2.75(0.90) 

2.71(0.86) 

Only  slight  differences  are  noted  for  the  all-digital  systems.  AD-HDTV  edged  out  DSC- 
HDTV,  which  edged  out  DigiCipher.  AD-HDTV  ranked  best  or  tied  for  best  for  all 
extensibility  features  and  would,  therefore,  be  ranked  best  or  tied  for  best  regardless  of 
the  weighting  of  the  features. 
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II.  Evaluation  bv  the  Review  Board 


A  tabulation  of  the  means  and  standard  deviations  from  the  evaluations  by  the 
Review  Board  is  shown  as  Matrix  II.  The  evaluations  by  all  Review  Board  members  are 
given  in  Appendix  II.  In  many  cases  the  Information  was  incomplete  (n)  or  verification 
(v)  was  needed.  The  number  of  times  that  "n"  and  "v"  appeared  in  the  review  board 
ratings  is  noted  by  the  number  inside  the  parentheses  In  the  matrix  that  follows. 

Interoperability 

All  five  systems  were  judged  to  be  easily  deliverable  by  satellite  or  Cable  TV. 
AD-HDTV  was  rated  most  interoperable  with  headers/descriptors  and  packet  networks. 
The  four  all-digital  systems  were  judged  to  interoperate  well  with  film  with  CCDC 
ranking  better  than  the  other  three  systems.  Rankings  of  other  characteristics  of 
Interoperability  were  close.  DSC-HDTV  was  most  interoperable  with  computers  with 
CCDC  close,  and  best  for  format  conversion  with  AD-HDTV  close.  DSC-HDTV  was 
judged  most  interoperable  with  digital  technology  and  interactive  systems,  with  other 
all-digital  systems  very  close.  CCDC  was  ranked  most  interoperable  with  NTSC,  with 
DSC-HDTV  very  close.  AD-HDTV  was  ranked  most  scalable  with  DSC-HDTV  very 
close.  DigiCipher  was  ranked  slightly  more  interoperable  with  cable  TV,  with  all  other 
systems  very  close.  If  all  features  are  weighted  uniformly,  the  overall  mean  for  AD- 
HDTV  Is  1.88,  DSC-HDTV  Is  1.90,  CCDC  is  2.13,  DigiCipher  is  2.43  and  Narrow- 
MUSE  is  3.61. 

Scope  of  Features  and  Services 

The  spread  between  evaluations  of  the  various  systems  was  small.  However, 
AD-HDTV  ranked  best  In  initial  use  for  ancillary  data,  data,  encryption,  addressing,  low- 
cost  receiver  and  VCR  capability.  DSC-HDTV  ranked  best  In  text  and  captioning. 
CCDC  ranked  best  in  audio.  Unless  substantially  higher  weighting  is  placed  on  the  last 
three  features,  AD-HDTV  would  be  ranked  best  for  scope  of  services  and  features. 

Extgnsibiilty 

No  significant  differences  are  noticed  among  the  mean  values  of  the  all-digital 
systems.  If  it  is  assumed  that  all  features  under  Extensibility  are  weighted  uniformly, 
DSC-HDTV  has  the  lowest  mean  (best  score)  with  2.43,  AD-HDTV  and  CCDC  are  tied 
with  a  mean  of  2.53,  DigiCipher  has  a  mean  of  3.08  and  Narrow-MUSE  of  4.50. 
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ATV  FEATURE 


InteroDerabili 


with  cable  TV 


with  diaital  techno 


with  headers/descnptors 


with  NTSC  _ 


with  film  _ 


with  computers  _ 


with  satellites  _ _ 


with  packet  networks 


with  interactive  systems 


Format  conversion 


Scalabilit 


Scope  of  Services  and 
Features  _ _ 


Initial  use  for  ancillary  data 


Audio _ 


Data  '  _ 


Text  _ 


Captionin 


Enc 


Address! 


Low-cost  receiver 


VCR  capabilit 


Extensibility _ 


to  no  visible  artifacts _ 


to  studio-cualitv  data  rate 


to  higher  resolution 


to  VHDTV _ 


to  UHDTV 


Provision  for  future 
compression  enhancement 


Matrix  II  -  REVIEW  BOARD  Evaluation 

Interoperability  Evaluation  Matrix  -  Revision, 
mean  (standard  deviation) 


1.22(0.44 


1.67(0.84 


1.36  (0.50 


1.91  (0.94 


1.55  (0.69 


2.49  (0.92 


1.18(0.40 


1.27(0.47 


2.34(1.05 


2.45  (1.13 


3.18(1.25 


1.25(0.46 


1.40(0.57 


2.73  (0.79 


1.45(0.69 


1.18(0.40 


2.04  (0.78 


1.27(0.47 


3.20  (0.42 


2.38  (0.86 


2.64  (1.21 


3.84  (0.98 


1.13(0.35 


1.61  (1.09 


2.90  (0.99 


1.80(0.92 


1.60(0.84 


3.51  (0.84 


1.20  (0.42 


3.20  (0.79 


2.57(1.34 


3.00  (0.94 


4.20  (0.63 


DSC  Narrow-  I  Footnotes 
Muse  I 


■ 

■ 

■ 

■ 

■ 


1.25  (0.46 


2.00 

(0.94; 

1.60 

(0.84; 

1.60 

(0.84; 

1.83 

(0.65; 

1.20 

(0.42 

2.30 

(0.67 

2.26 

(0.83 

2.20 

(0.79 

3.30 

(1.06 

1.89 

(1.45; 

3.97 

(i.oo; 

4.50 

(0.74; 

3.05 

(1.15 

3.59 

(1.16 

1.86  (1.00 


3.57(1.17 


4.09  (1.04 


3.80  (1.23 


1.67 

(I.oo; 

1.20 

(0.42; 

1.36 

(0.67 

1.50 

(I.27; 

1.55 

(1.21 

1.36 

(0.67 

1.18 

(0.40 

2.91 

(1.04 

2.18 

(0.60 

1.78(0.97 


1.09  (0.30 


1.64(0.81 


2.00 

(0.94 

2.00 

(0.94 

2.80 

{\23 

3.00 

(1.05 

3.00 

(1.05 

II 

■I 

■I 

■I 


1.36(0.50 


1.45(0.82 


1.68  (1.19 


3.30  (0.95 


1.75 

(1.04; 

1.20 

(O.42; 

1.70 

(1.06; 

1.56 

(1.33; 

1.60 

(1.26 

1.40 

(0.84 

1.30 

(0.95 

3.40 

(0.70 

2.70 

(0.48 

1.75(1.04 


1.20(0.42 


1.50  (0.85 


1.22  (0.44 


1.30  (0.48 


1.40  (0.70 


1.40(0.70 


3.00  (0.94 


2.60  (0.52 


1.82  (1.47 


2.27  (1.56 


2.00  (1.41 


2.91  (1.22 


2.40  (0.97) 


1.70  (0.67 


2.00  (0.67 


3.00  (0.94 


3.00  (0.94 


3.00  (0.94 


2.50  (1.08) 


II 

■I 

II 

II 


2.33(1.00 


2.67(1.12 


3.56  (0.88 


3.56(0.88 


2.78(1.30) 


1.89(0.60 


1.89(0.60 


2.78(1.09 


2.78(1.09 


2.78(1.09 


2.44(1.01) 


4.05  (0.90 


4.50  (0.85 


4.80  (0.63 


4.80  (0.63 


4.65  (0.67) 


jjgluatlof^atlng 

1 

2 

3 

4 

5  _ 


yoMd  ImpiementatlOQ 
■  Easy  to  Imptement _ 

>  Modwiflv  difficult  _  Z 

.  Very  difficult  to  implement 
No  aniwer  of  Incomplete 
Voriticallon  needed _ 


MATRIX  H  -  REVIEW  BOARD  Evaluation  (con’t) 
ATV  Interoperability  Evaluation  Matrix  -  Revision  A 


ATV  Proponent  System 


ATV  FEATURE 

Advanced 

CCDC 

DifliCipher 

DSC 

Narrow- 

Diaital 

Muse 

Interoperability 

with  cable  TV 

with  digital  technology 

vM) 

v(1)n(1) 

v(1)n(1) 

v(1)n(1) 

v(1)n(1) 

with  headers/descriptors 

V(1) 

v(1) 

v(1) 

v(1) 

n(1) 

with  NTSC 

V(1) 

v(1)n(1) 

v(1)n(1) 

v(1)n(1) 

v(1Jn(1) 

with  film 

v(2) 

v(1) 

v(1) 

v(1) 

with  computers 

v(2) 

y{2)  n(1)  . 

vdind) 

v(2)  n(1) 

v(1) 

with  satellites 

with  packet  networks 

v(2)  n(1) 

Vf1)n(3) 

n(1) 

v(1)n(1) 

n(3J  _  . 

with  interactive  systems 

v(2) 

v(1)n(2) 

n(1) 

v(2) 

Format  conversion 

v(1) 

v(2) 

v(1) 

v(2) 

v(1) 

Scalability 

v(1) 

v(2) 

v(1) 

v(2) 

W1) 

Scope  of  Services  and 

Features 

initial  use  for  ancillary  data 

v(1)n(1) 

v(1)n(1) 

v(1)n(1) 

v(1)n(1) 

n(1)  . 

Audio 

Data 

v(1) 

v(1) 

V(1) 

v(1) 

Text 

v(1) 

v{1) 

Captioning 

v(1)  _ 

v{1J 

Encryption 

v(2) 

v(2) 

v(2) 

v(2) 

Addressing 

v(2) 

_y(2). . . 

v(2) 

v(2) 

Low-cost  receiver 

...  Vi2)  ... 

v(2)  n(1) 

v(2) 

v(2) 

v(1)  .. 

VCR  capability 

v{2) 

v(2) 

v(1) 

v(2) 

W1)n(1) 

Extensibility 

to  no  visible  artifacts 

v(2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v(1)n(2) 

to  studio-guality  data  rate 

v(2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v(1)n(2) 

to  higher  resolution 

v(2)  n(2) 

vf2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v(1)n(2i 

to  VHDTV 

v(2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v(2)  nl2} 

v(1)n(2) 

to  UHDTV 

v(2)  n{2) 

v(2)  n(2) 

v(2)  n(2) 

v(2)  n(2) 

v{1)n(2).._ 

Provision  for  future 
compression  enhancement 

v(2)  n(2) 

v(2)n(2) 

v(1)n(2) 

v(2)  n(2) 

v(1)n(2) 

ATV  Interoperability  Kev; 


Evaluation  Ratlno 

ProDosed  ImDlementatlon 

1 

-  Easy  to  implement 

2 

3 

•  Moderately  difficult 

4 

5 

-  Very  difficult  to  implement 

n 

No  answer  or  incomplete 

V 

Verification  needed 
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III.  Experts  Interoperability  Evaluation 


The  previous  analyses  involved  averaging  a  number  of  evaluations.  In  the 
meeting  on  Friday,  September  25,  several  experts  from  the  Review  Board  led  the 
discussion  of  interoperability  issues  for  evaluating  the  advanced  television  systems. 
Each  discussed  features  from  the  evaluation  matrix  important  to  the  area  of  technology 
in  which  he  is  an  expert.  The  following  Experts'  Evaluation  Matrix  consists  of  the 
evaluations  by  each  expert  of  the  features  that  he  discussed. 

Interoperability 

All  systems  were  judged  interoperable  with  cable  TV.  The  progressive-scan 
square-pixel  systems  were  judged  most  interoperable  with  digital  technology.  AD- 
HDTV  was  judged  most  Interoperable  with  headers/descriptors.  The  progressive  scan 
systems  were  judged  most  interoperable  with  NTSC.  AD-HDTV  and  DSC-HDTV  were 
judged  most  interoperable  with  film.  DSC-HDTV  was  judged  most  interoperable  with 
computers.  All  systems  were  judged  interoperable  with  satellites.  DSC-HDTV  was 
judged  most  interoperable  with  interactive  systems.  The  progressive  systems  were 
judged  best  for  format  conversion.  AD-HDTV,  DSC-HDTV  and  Narrow-MUSE  were 
ranked  best  for  scalability.  DSC-HDTV  was  ranked  best  or  tied  for  best  for  all  features 
except  headers/descriptors.  If  all  features  were  weighted  equally,  DSC-HDTV  would  be 
rated  the  most  interoperable. 

Scope  of  Features  and  Services 

DSC-HDTV  and  AD-HDTV  were  ranked  best  for  data  and  for  VCR  capability. 
DSC-HDTV  was  tied  with  Narrow-MUSE  for  low-cost  receiver  and  was  tied  with  CCDC 
for  text  interoperability.  DSC-HDTV  would,  therefore,  be  given  the  best  overall 
evaluation  for  scope  of  services  and  features. 

Extensibility 

AD-HDTV  was  rated  best  or  tied  for  best  in  all  categories  and  would,  therefore, 
be  most  extensible. 
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MATRIX  III-  EXPERTS*  EVALUATION 
ATV  Interoperability  Evaluation  Matrix  -  Revision  A 


ATV  Proponent  System 


ATV  FEATURE 

Advanced 

Diaiial 

CCDC 

DioiCIoher 

Narrow- 

Muse 

Interoperability 

with  cable  TV 

1  . 

1 

1 

1 

1 

with  digital  technology 

3 

2 

4 

2 

5 

with  headers/desCTiptors 

2 

3n 

3n 

3n 

5 

with  NTSC 

2 

1 

2 

1 

3 

with  film 

1 

2 

2 

1 

4 

with  computers 

2.42V 

2.42nv 

3.1nv 

2.28nv 

3.7v 

with  satellites 

2 

2 

2 

2 

2 

with  packet  networks 

2 

3 

4/2 

2 

n 

with  interactive  systems 

2.42V 

2.42nv 

3.1nv 

2.28nv 

3.7v 

Format  conversion 

5 

2v 

4 

2v 

5 

Scalability 

3 

4 

4 

3 

3 

Scope  of  Services  and 

Features 

Initial  use  for  ancillary  data 

1 

1 

1 

1 

1 

Audio 

1 

1 

1 

1 

1 

Data 

1 

2 

2 

1 

2 

Text 

5 

2v 

5 

2v 

5 

Captioning 

1 

1 

1 

1 

1 

Encryption 

1 

1 

1 

1 

3 

Addressing 

1 

1 

1 

1 

3 

Low-cost  receiver 

3 

3 

3 

2 

2 

VCR  capability 

2/3 

3 

3 

2/3 

na(5) 

Extensibility 

to  no  visible  artifacts 

1 

1 

1 

2 

5 

to  studio-quality  data  rate 

1 

1 

1 

2 

5 

to  higher  resolution 

2 

3 

4 

2 

5 

to  VHDTV 

2 

3 

4 

2  1 

5 

to  UHDTV 

2 

3 

4 

2 

5 

Provision  for  future 

2 

2 

2 

2 

5 

compression  enhancement 

ATV  Interoperability  Kev; 

(/  Modified  Later) 

Evaluation  Ratina 

ProDOsed  ImDiementatlon 

1 

•  Easy  to  Implement 

2 

3 

•  Moderately  difficult 

4 

5 

-  Very  difficult  to  implement 

n 

No  answer  or  Incomplete 

V 

Verification  needed 

Footnotes 


C. Tanner 
G.  Demos 
M.  Ltebhofd 
A.  Uytiendaeie 
A.  Toth 
M.  Uebhotd 
A.  Uynendaete 
J.  Deiiislo 
M.  Ltebhold 
G.  Demos 
J.  Bellisio 


J.  Cohen 
J.  Cohen 
J.  Cohen 
G.  Demos 
G.  Hanover 
C.  Tanner 
C.  Tanner 
G.  Hanover 
J.  Hamalainen 

J.  Fuhrer 
J.  Fuhrer 
A.  Toth 
A.  Toth 
A,  Toth 
J.  Bellisio 
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IV.  Summary  of  Interoperability  Evaluations 


Overall 

The  four  all-digital  systems  were  found  to  be  superior  to  the  Narrow-MUSE  system  with 
respect  to  Interoperability  and  Extensibility  on  all  but  two  or  three  characteristics  where  Narrow- 
MUSE  was  approximately  equal  to  the  all-digital  systems.  With  respect  to  Scope  of  Services 
and  Features,  the  all-digital  systems  also  ranked  better  than  Narrow-MUSE  but  the  difference 
was  smaller. 

Interoperability 

AD-HDTV  was  judged  more  Interoperabie  with  headers/descriptors  and  packet  networks. 
DSC-HDTV  was  judged  more  Interoperable  with  computers  with  CCDC  close  behind.  CCDC 
was  judged  more  interoperable  with  film.  The  Review  Board  evaluations  for  interoperabtlity  were 
averaged  with  the  following  resuits  shown  to  greater  precision  than  justified  by  the  subjective 
nature  of  the  assessment  process: 

1 .88  for  AD-HDTV,  1 .90  for  DSC-HDTV,  2.13  for  CCDC.  2.43  for  DigiCipher  and  3.61  for 
Narrow-MUSE. 

AD-HDTV  also  led  the  ranking  by  the  proponents  when  averaged  over  all  interoper^ility 
categories,  but  DSC-HDTV  was  ranked  by  the  individual  experts  best  or  tied  for  best  in  all 
categories  except  headers/descriptors. 

Scope  of  Services  and  Features 

AD-HDTV  was  ranked  by  the  Review  Board  as  best  or  tied  for  best  in  six  out  of  nine 
features.  With  uniform  weighting,  the  average  evaluations  were; 

1 .66  for  AD-HDTV,  1 .71  for  DSC-HDTV,  1 .83  for  CCDC ,  1 .85  for  DigiCipher 
and  2.51  for  Narrow-MUSE. 

It  should  be  noted  that  the  differences  are  not  significant  and  that  uniform  weighting  may  not  be 
appropriate  as  discussed  on  page  5.  AD-HDTV  also  received  the  best  average  evaluation  in  the 
proponents’  evaluation.  DSC-HDTV  was  ranked  best  or  tied  for  best  in  all  features  by  the 
individual  experts. 

Extensibility 

DSC-HDTV  was  ranked  by  the  Review  Board  as  best  or  tied  for  best  in  four  out  of  the  six 
categories  and  was  almost  equal  to  AD-HDTV  in  a  fifth  feature.  With  uniform  weighting,  the 
average  evaluations  were: 

2.43  for  DSC-HDTV,  2.53  for  both  AD-HDTV  and  CCDC.  3.08  for  DigiCipher,  and  4.50 
for  Narrow-MUSE. 

AD-HDTV  was  ranked  by  the  proponents  as  best  or  tied  for  best  in  all  extensibility  categories. 
AD-HDTV  was  also  ranked  best  or  tied  for  best  by  the  individual  experts. 
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Notes  on  Interoperability  Evaluation  Matrix 


INTEROPERABILITY 

CABLE 

All  proposed  systems  can  be  delivered  successfully  over  cable. 

DIGITAL  TECHNOLOGY 

The  four  digital  systems  have  an  extreme  advantage  here. 

HEADER/DESCRIPTORS 

AD-HDTV  has  been  designed  and  tested  with  header/descriptor  capabilities. 
Header/descriptors  could  be  retrofitted  to  the  other  digital  systems  with  some  difficulty. 
Header/descriptors  are  not  meaningful  or  implementable  in  an  analog  system. 

NTSC 

The  2:1  relationship  of  the  1050  line  systems  (AD-HDTV  and  DigiCipher)  to  NTSC  is  an 
advantage.  The  3:2  relationship  of  the  787  line  systems  (CCDC  and  DSC)  is  not  as  desirable,  but 
better  than  the  1 1 25  lines  of  Narrow  Muse. 

FILM 

AD-HDTV.  DigiCipher  and  CCDC  have  a  film  transmission  mode  at  24  fps.  DSC  requires 
conversion  to  59.94  fps,  but  can  benefit  from  some  increase  in  encoding  efficiency.  Narrow  Muse 
will  exhibit  traditional  TV  and  film  interoperability. 

COMPUTERS 

AD-HDTV  has  interoperability  at  the  picture  level  (in  its  progressive  scan  and  square  pixel 
mode)  and  at  the  compressed  bit  stream  level  (MPEG).  CCDC  and  DSC  only  have  progressive 
scan  and  square  pixel  interoperabUity  at  the  picture  level  DigiCipher  suffers  from  interlace  and 
rectangular  pixels  in  all  modes,  but  it  at  least  is  digital.  As  an  analog  system.  Narrow  Muse  will  be 
very  difficult  to  integrate  with  computers. 
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SATELLITE 

All  digital  systems  require  remodulation  (to  QPSK)  for  satellite  transmission.  AD-HDTV, 
DigiCipher  and  CCDC  all  have  constant  data  rate,  while  DSC  suffers  from  a  variable  data  rate, 
making  the  interfacing  more  difficult.  As  an  analog  system,  Narrow  Muse  will  require 
remodulation  to  FM,  a  relatively  simple  task  that  is  similar  in  complexity  to  the  QPSK 
remodulation  required  in  AD-HDTV,  DigiCipher  and  CCDC. 

Packet  networks 

Only  AD-HDTV  has  a  packetized  transmission  format  that  has  been  implemented  and  tested. 
Other  digital  systems  can  be  interfaced  to  packet  networks  with  some  moderate  difficulty.  As  an 
analog  system.  Narrow  Muse  will  be  very  difficult  to  integrate  with  packet  networks. 

INTERACTIVE  SYSTEMS 

AD-HDTV  has  MPEG  compression,  which  will  be  used  in  CD-I  and  other  interactive  systems. 
CCDC,  DigiCipher  and  DSC  can  be  interfaced  to  interactive  systems,  but  may  require  either 
multistandard  decoders  or  decompression/recompression  (with  the  associated  artifacts).  As  an 
analog  system.  Narrow  Muse  will  be  very  difficult  to  integrate  with  computers. 

FORMAT  CONVERSION 

Picture  level  format  conversion  equipment  has  been  in  existence  for  several  decades.  There  is  a 
wealth  of  knowledge  about  performance  and  cost/performance  tradeoffs.  Therefore,  all  systems 
are  equal  at  this  level.  Conversion  among  compressed  forms  is  much  more  difficult,  and  will 
likely  require  decompression/recompression  (with  the  associated  artifacts).  AD-HDTV  has  an 
advantage  over  the  other  digital  systems  because  its  MPEG  compression  syntax  will  be  common  to 
other  MPEG-based  systems,  and  it  will  not  require  decompression/recompression. 

SCALABILITY 

At  the  compressed  bit  stream  level,  AD-HDTV  and  DSC  both  exhibit  some  scalabiliQr,  since 
they  have  “viewable  picture  subsets”  as  part  of  their  two-tier  transmission.  AD-HDTV  carries  the 
scalability  to  the  signal  level,  since  the  HP  bit  stream  is  a  separate  carrier  (DSC  uses  a  single  carrier 
that  is  time  division  multiplexed).  CCDC  and  DigiCipher  are  inherently  single  tier  systems, 
although  they  can  have  some  rudimentary  DC  term  extraction  from  their  DCT. 
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STUDIO  QUALITY 

All  proposed  systems  can  be  extended  to  provide  studio  quality  versions.  AD-HDTV’s  MPEG 
compression  can  accommodate  several  different  approaches  within  the  same  compression  syntax, 
depending  upon  studio  requirements  (e.g.,  with  a  flexible  GOP  structure,  MPEG  can  have  all  I- 
frames  to  meet  editing  requirements).  MPEG  GOP  structure  also  makes  studio  operations  in 
compressed  form  easier.  Narrow  Muse  will  require  encoding/decoding  to  1125/60. 

HIGHER  RESOLUTION 

All  proposed  systems  can  be  extended  to  provide  higher  resolution  versions.  AD-HDTV’s 
MPEG  compression  is  already  a  standard  with  this  proven  capability.  As  an  analog  system. 
Narrow  Muse  is  not  well  suited  to  this  flexibility  to  accommodate  higher  resolution. 

VHDTV  AND  UHDTV 

The  four  digital  systems  can  be  extended  to  provide  VHDTV  and  UHDTV  versions.  AD- 
HDTV’s  MPEG  compression  is  already  a  standard  with  the  proven  capability  to  increase 
resolution.  VHDTV  and  UHDTV  that  is  backward  compatible  with  HDTV  will  require  an 
augmentation  approach.  AD-HDTV’s  packet  structure  provides  a  mechanism  to  achieve  this,  while 
other  digital  systems  will  be  more  difficult  to  augment  without  damaging  service  to  existing 
receivers.  As  an  analog  system.  Narrow  Muse  is  not  well  suited  to  benefit  from  analog 
augmentation  approaches. 

FUTURE  COMPRESSION  ENHANCEMENT 

The  four  digital  systems  can  all  benefit  from  future  encoder  improvements.  However,  MPEG 
compression  was  designed  to  provide  this  capability  as  a  design  consideration.  Narrow  Muse  is 
not  as  well  suited  to  benefit  from  future  encoder  improvements. 
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FAX  TO: 

FROM: 


GuyBeaklcy 
Stellacom,  Inc. 

Fax:  703^43-1698 


Bob  Rast 


SUBJECT :  Interoperability  Evaluation  Matrix 


CC:  Bob  Sanderson 
WooPaik 
Jerry  Heller 
JefTKrauss 
Quincy  Rodgers 


Attached  please  find  the  Interoperability  Evaluation  Matrix,  which  we  have  filled  in,  with  our 
rating  of  each  of  the  digital  proponents. 

We  have  refined  the  rating  scale  you  have  proposed,  and  wish  to  share  our  definition  with  you. 
V^ereas  as  your  rating  scale  was  based  on  ease  of  Implementation  only,  we  believe  that  a  fair 
rating  scale  should  include  other  factors  as  well.  The  set  of  factors  we  used  are; 

•  •  Ease  of  implementation  1.  Ea^ 

3.  Moderately  difficult 
5.  Very  difficult 

•  Performance  1.  Clear  advantages 

3.  OK 

5.  Questionable 

•  How  real  1.  Demonstrated 

2.  In  hardware,  not  demonstrated  or  tested 

3.  Has  proposal,  seems  OK 
S.  Questionable 

•  Cost  effectiveness  1.  Included  and  cost  effective 

3.  Not  included,  probably  OK 
5.  Looks  costly 

The  following  convQrs  key  thoughts  affecting  the  rating  of  each  item; 

Interoperability 

with  digital  technology  -  tonic  seems  redundant  against  later  more  specific  topics,  or 
ambiguous.  AJl  systems  distal. 

^  with  headers/descriptors  -  all  have  proposals,  ADTWs  is  most  real,  but  not  fUUy  verified 

/  with  NTSC  -  DigiCipher  has  a  developed,  compatible  system,  b  tateijaced,  and  hw 

favorable  scan  parameters  for  conversion  to  NTSC.  Through  MUSE,  Narrow  ^SE 
has  demonstrated  NTSC  compatibility.  ADTV  has  the  scanning  relauonship 
advantages  ^so. 

vwth  film  -  DigiCipher  and  CCDC  have  implemented  this  feature  in  prototypes,  and 
i  demonstrated  it .  Interlace  systems  get  more  benefit  from  the  feature. 
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Interoperability  Evaluation  Matrix 
October  6,  1992 
Page  2 

with  computers  -  progressive  systems  have  advantage.  Square  pixels  also  an  advantage, 
but  not  as  much  so.  DigjCipher/computer  interoperability  not  as  well  thought  out. 

with  satellites  •  DigiCipher  has  demonstrated  satellite  transmission,  and  capability  built  into 
CCDC. 

with  cable  «  even  i^  cable  is  being  evaluated  elsewhere,  it  is  important  to  any  legitimate 
interoperability  assessment,  ^ote  that  there  are  overltos  wth  other  parts  of  the 
Advisory  Committee  in  other  areas  also.)  Di^Cipher  capability  has  been  demonstrated 
and  tested,  uses  straightforward  QAM  approach.  CCDC  is  compatible.  VSB  more 
desirable  than  split  signal  QAM. 

with  packet  networks  -  nobody  has  demonstrated,  but  ADTV  most  real. 

with  interactive  systems  -  no  particular  advantages  for  anybody. 

Format  conversion  >  somewhat  easier  to  convert  progressive  images. 

Scalability  -  ADTV  and  DSC  have  described  scalable  subset  at  transwitch  level.  All 
scalable  at  the  picture  processing  level. 


Scope  of  Services  and  Features 

Initial  use  of  ancillary  data  -  no  particular  advantages. 

Audio  -  Dolby-based  ^sterns  compatible  with  Dolby  surround  sound  proposal,  MIT  audio 
compatible  at  interface. 

Data  •  no  particular  advantages. 

Text  •  no  particular  advantages. 

Captioning  >  no  particular  advantages. 

Encryption  •  all  amenable,  no  particular  advantages. 

Addressing  -  DigiCipher  and  CCDC  reserved  capacity  for  this. 

Low-cost  receiver  *  possible  advantage  for  ADTV  and  DSC  with  transwitch  layer  proper 
subset,  ofi&et  by  DigiCipher  fundamentally  lower  cost. 

VCR  capability  -  IMpCipher  has  demonstrated,  and  described  tricks  in  greatest  detail  (to 
SSAVP-3).  CCDC  is  compatible  with  Di^Cipher  in  this.  DSC  tricks  feasibiUty 
deserves  more  detail. 

Extensibility 

to  no  visible  artifacts  •  DigiCipher  has  shown  Emulations  at  30  Mbps,  CCDC  is  related 
tedmolgy. 

to  studio-quality  data  rate  -  none  have  demonstrated,  nor  done  much  investigation. 

to  higher  resolution  •  some  advantage  to  interlace  for  defined  migration  to  1050 
^  progressive.  MUSE  is  Mgher  resolution,  and  Narrow  MUSE  is  derived  from  it. 

j  to  VHDTV  -  no  advantages,  some  questions. 

i  to  UHDTV  -  no  advantages,  some  questions. 

Provision  for  future  compression  enhancement  -  no  clear  advantages. 
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H0TE8  FOR  INTEROPERATILITY  EVALUATION  MATRIX  | 

(a)  The  four  digital  systems  all  share  the  virtues  of  digital  signal 
representation,  so  that  the  images,  parts  of  images,  and  data 
associated  with  images  can  be  stored,  rearranged  and  reformatted 
in  arbitrary  ways  depending  on  the  needs  of  the  application. 
Such  flexibility  is  not  available  for  an  analog  system. 

(b)  ADTV  and  DSC  have  a  larger  amount  of  channel  capacity  available 
for  use  in  headers  and  descriptors.  DSC  has  a  larger^  unassigned 
channel  capacity  for  ancillary  data  than  the  other  digital  sys¬ 
tems,  while  ADTV  appears  to  have  the  least  such  unassigned  ancil¬ 
lary  data  channel  capacity. 

(c)  The  conversions  from  interlaced  scans  to  interlaced  scans  are 
more  complicated,  and  are  likely  to  incorporate  conversion  arti¬ 
facts.  The  Narrow-MDSE  scanning  numbers  arc  not  congenial  to 
NTSC  conversion. 


(d) 


(e) 


(f) 

(g) 


(h) 


Film  to  interlace  requires  extra  1/2-frame  memory  for  reinterlace 
at  receiver,  for  adtv  and  DigiCipher.  Narrow-MUSE  has  no  mecha¬ 
nism  for  taking  advantage  of  film's  lesser  frame  rate  to  deliver 
better  pictures. 


adtv  and  DigiCipher  will  introduce  interlace  artifacts.  DigiCi¬ 
pher  and  CCDC  do  not  have  a  clean  packet/slice  structure  from 
which  to  extract  data  for  windowing.  Narrow-MUSE  uses  analog 
signals. 

Digital  systems  will  avoid  the  addition  of  noise  associated  with 
analog  modulation  on  satellite. 

ADTV  and  DSC  have  fixed-size  packets  with  priority  assignments. 
DSC  has  a  variable  information  content  rate  which  is  easily 
carried  by  packet  networks.  Therefore,  a  broadcaster 
could  adjust  the  2-level/4-level  mix  for  applications  that  take 
advantage  of  the  variable  bit-rates  supportable  by  packet  trans¬ 
mission.  DigiCipher  and  CCDC  data  objects  have  variable  size,  no 
slice  boundaries,  and  have  no  priority  structures. 


adtv  and  DigiCipher  are  interla&ed,  making  interactive  operation 
more  complicated.  ADTV  has  long  round-trip  delay.  DigiCipher 
and  CCDC  incorporate  4  moving  panels  with  progressive  refreshing, 
vhich  makes  interactive  applications  less  graceful. 
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(i)  ADTV  is  interlaced,  so  that  format  conversions  are  more  compli¬ 
cated  and  introduce  artifacts,  but  has  service  type  indicators. 
Digicipher  is  interlaced.  CCDC  and  DSC  are  progressive-scanned, 
facilitating  conversions  spatially  and  temporally.  Digicipher 
notion  estimates  are  less  precise  (not  sub-pixel,  not  local), 
making  frame  interpolation  more  difficult. 

(j)  None  is  scalable  in  the  strictest  sense  of  subsettable  data 
streams  over  a  vide  range  of  performance  levels.  ADTV  and  DSC 
have  priority  data  providing  limited  scalability.  Interlace  in 
ADTV  and  Digicipher  makes  scalability  difficult  since  DCT  in  an 
interlaced  field  will  exhibit  different  artifacts  from  field  to 
field  when  only  DC  coefficients  are  retained. 

(k)  The  digital  systems  all  can  flexibly  support  these  features  by 
appropriate  formatting  and  use  of  headers  and  descriptors.  The 
Narrow— MUSE  system's  audio  is  digital,  but  has  limited  other 
digital  channel  capacity  for  these  features,  and  analog  encryp¬ 
tion  is  complicated.  CCDC  and  Digicipher  have  only  limited 
capacity  and  lack  supporting  data  structures  for  new  features. 

(l)  ADTV  requires  more  memory  in  receivers,  and^  MPEG  decoding  with 
forward  and  reverse  prediction  is  more  complicated.  ^  Narrow— MDSE 
uses  a  variation  of  the  already— developed  MUSE  algorithm  that  ^  is 
still  quite  complex,  involving  analog  functions  (performed  with 
digital  circuits) .  The  other  systems  have  similar  processing 
complexity  to  each  other. 

(m)  Narrow-MUSE  claims  digitized  version  of  signal  can  be  recorded 
using  current  VCR  technology,  but  high  bit  rate  will  be  very 
expensive.  ADTV  can  use  l-frames  for  clean  trick  modes,  although 
only  particular  speed  (multiples)  are  supported,  and  extraction 
of  data  may  be  costly.  CCDC  and  Digicipher  can  use  PCM  data  to 
support  fast  search,  but  speed  range  (intraframe  and  quality)  are 
limited.  DSC  can  search  at  any  speed  but  with  reduced  quality. 

(n)  Because  ADTV  and  Digicipher  are  interlaced, 'they  may  require  oo^e 
bits  for  coding  using  intraframe  only.  DSC  and  cCDC  encode 
pj^ogressive— scanned  images,  are  therefore  more  efficient  in 
moving  images.  DSC  uses  vector  quantization  for  coefficient 
pattern  selection,  allowing  superior  matching  of  coding  algoritiOT 
to  perception  in  human  visual  system.  Narrow— MUSE  extensibility 
limited  by  practical  analog  processing  in  affordable  bandwidths. 
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(o)  The  progressive-scanned  CCDC  and  DSC  have 

artifact-free  extensions  to  higher  resolution  (both  spatially  land 
temporally)  than  the  interlaced  ADTV  and  DigiCipher  systems . 
Narrow-MUSE  is  not  flexibly  extensive  because  of  the  retired 
bandwidth  and  analog  processing.  The 

which  move  from  field  to  field  with  a  total  cycle  time  of  20 
fields,  and  the  associated  constraints  on  motion  vectors  compli¬ 
cate  extensibility  of  CCDC  and  DigiCipher. 

rn)  All  the  digital  systems  can  use  a  layering  approach  to  incorpo- 
rate  new  developments  in  compression,  and  all  can  exploit  im¬ 
provements  in  motion  estimation.  The  4-panel  processing  in  CCDC 
Ind  DigiCipher  will  make  it  difficult  to  extend  in  a  compatible 
way  to  higher  spatial  and  temporal  resolution.  Analog  processing 
limits  prospects  for  Narrow-MUSE. 
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FCC  WP4 

Review  Board  Member  Interoperability  Evaluation 
Gary  Demos 
6  October  1992 
PYecutive  Summary 

The  following  discussion  is  my  evaluation  of  the  advanced  television  (HDTV)  proposals  before  the 
FCC  as  part  of  the  WP4  Interoperability  review  process. 

To  summarize  my  comments: 

*  I  asked  a  number  of  questions  of  the  proponents,  many  of  these  questions  were 
ignored  or  answered  negatively 

*  The  FCC  teeing  process,  which  is  nearing  completion,  was  flawed  in  that  many  crucial 
issues  of  interoperability  were  not  tested.  This  is  partly  due  to  the  fact  that  the  testing  process 
was  begun  when  all  the  systems  were  analog.  The  t^ing  process  should  be  redesigned  in  light 
of  digital  technology  and  interoperability  issues. 

The  most  critical  interoperability  problems  are: 

'Two  of  the  digital  systems  and  the  one  analog  system  are  interfaced,  with  non  square 
pixels.  These  systems  should  be  rejected. 

*  All  of  the  systems  are  at  59.94  Hz  or  60.0  Hr  This  is  not  compatible  with  computer 
display  interoperability,  requiring  greater  than  70  Hz.  No  system  should  be  accepted  at  59.94  or 
60.0  Hz. 

’  The  apparent  ability  to  send  movies  (24  fps)  at  higher  resolution  has  not  been  explored 
'  The  header  proposals  are  not  universal,  but  are  buried  within  the  HDTV  packet  formats 
'  The  packet  structures  are  untested  with  ATM  or  other  packet  networks 
'  System  modularity,  which  would  allow  the  best  of  each  system,  has  not  been  evaluated. 

*  None  of  the  systems  is  scalable  downward.  A  lower  resolution  subset  would  allow  low 
cost  reception  at  resolution  below  full  quality,  but  potentially  better  quality  than  NTSC. 

*  Still  imagery  communication  has  not  been  tested,  although  apparently  feasible. 

'  It  could  be  signrficant  if  a  partial  screen  update  capability  were  provided.  Some  of  the 
systems  may  have  this  capability,  but  it  has  not  been  explored  or  tested.  This  would  allow 
reduced  bit  rate  presentation  of  high  resolution  images  during  times  when  other  data  is  being 
sent  Oike  address  authorizatton  lists),  or  for  use  by  lower  data  rate  devices. 

•Compression  quality  is  rapidly  advancing.  We  need  only  examine  the  recent  two  years 
to  get  a  feel  for  the  rapid  pace  of  development.  No  proponents  offered  proposals  for  handling 
major  advancements  in  compression  technology.  In  ten  years,  it  is  likely  that  all  of  the  proposed 
systems  will  be  obsolete  in  their  approach.  How  will  we  handle  the  e>rfensibility  issue  in  order  to 
allow  graceful  adoption  of  future  advancements  in  image  compression  and  representation? 
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*  The  F(X  process  has  encouraged  innovation  up  to  this  point  by  stimulating  digital 
systems  to  be  proposed  by  the  proponents.  However,  at  this  point  what  is  required  is 
cooperation  and  system  modification  in  an  orderly  manner,  with  subsequent  testing  of  any  such 
modificatbns.  The  FCC  process  as  now  constituted  IS  DESIGNED  TQ  PRECLUDE  such 
cooperation,  system  modification,  and  subsequent  testing!  Thus,  the  work  of  this  interoperability 
review  board  is  likely  to  have  little  significance  unless  the  FCC  process  is  amended  to  take 
interoperability  seriously. 

Summary'  Of  My  Evaluation  Of  The  Proponents. 

My  evaluation  is  presented  in  much  more  detail  below.  However,  a  summary  of  my  view  of  each 


Weaknesses 

59  .  S4  Hz 
No  24  Hz  mode 
Insufficient  ATM  Correction 
Odd  Format  Size  (523/262) 
Regional  leak  not  fully  tested 
Buried  Header  Not  Universal 
Partial  Image  Update  Not  Tested 
Dependent  Audio  Channels 
Not  Scalable 

59.94  Hz 

No  ATM  Mapping 

Data  Not  Prioritized 

Odd  Format  Size  (525  Datalines) 

No  Header  Proposed 

Partial  Image  ^xJate  Not  Tested 

Sliding  Panels 

Not  Scalable 


system  is  presented  here: 


Accept  If  Modified; 
AT&T/ZenIth 


M  IT/AT  V  A 


Strengths 

Progressive  Scan 
Square  Pixels 
Ability  To  Correct  ATM 
Prioritized  Data 

8x8  small  motion  vector  blocks 
Sub-Pixel  motion  resolution 
Vector  Quantization 
Easy  NTSC  DownConversion 
Digital 

Progressive  Scan 
Square  Pixels 
24  Hz  Film  Mode 
Independent  Audio  Channels 
Sub  Pixel  motion  resolution 
Both  8  x  8  and  16  X 16  motion 
Easy  NTSC  DownConversion 
Digital 


Reject  As  Not 
ATRC 


Interoperable: 

Packet  Structure 
Prioritized  Data 
Digital 


G.l. 


Digital 


Interlaced  Scan 
Non-Square  Pixels 
Provision  For  Square  Confused 
59.94  Hz 

Header  Under  ATRC  Packets 
Dependent  Audio 
16x16  coarse  motion  only 
Not  Scalable 

Interlaced  Scan 
Non-Square  Pixels 
59.94  Hz 
No  Header 
Dependent  Audio 
1 6  X  32  coarse  motion  only 
Sliding  Panels 
Not  Scalable 


NHK 


Analog 
Interlaced 
59.94  Hz 
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No  Header 
No  Data  Formats 
Not  Scalable 


Unanswered  Questions 

•  I  submitted  a  list  of  questions  which  I  was  concerned  with  to  the  proponents  prior  to  the  review 
meeting.  A  copy  of  my  questions  are  attached  to  this  evaluation.  My  questions  were  answered  in 
writing  and  by  presentation  by  MIT.  None  of  the  other  proponents  answered  the  questions 
directly,  although  some  of  the  questions  were  answered  in  the  course  of  presentations. 

•  In  general,  the  answers  which  I  was  able  to  get  fell  into  the  following  categories: 

1)  Seems  reasonable,  have  potential  solution,  but  haven't  tried  it 

a)  Could  be  tested  without  undue  difficulty 

b)  Would  be  difficult  to  test  with  existing  prototype  equipment 

2)  Good  issue,  haven't  considered  it. 

3)  Not  able  to  support  the  item  concerned. 

•  Some  of  the  proposals  for  solutions  seem  too  preliminary  to  be  plausible 
I  will  cover  these  issues  in  more  detail  here 

Testing 

•  There  have  been  substantial  flaws  in  the  testing  process  from  an  interoperability  point  of  view. 
These  are; 

•  The  ATTC  only  provided  59.94  Hz  signals  to  the  proponents.  No  other  rates  were 
explored. 


•  24  Hz  signals  were  not  supplied  even  though  some  of  the  systems  can  accept  and 
transmit  at  24  Hz 

•  Increased  resolution  at  24  Hz  vs  higher  rates  has  been  proposed,  but  has  not  been 

tested. 

•  70+  Hz  computer  display  compatibility  was  not  tested  in  any  way 

•  Still  frames  were  not  tested.  Quality,  how  many  per  second,  etc.  should  be  tested. 

•  No  flexibility  in  performance  was  tested.  Such  ftexibifity  would  include: 

•exploring  the  limits  of  pi>«ls  per  second  (Jae  Urn's  comments  were  that  he  could 
go  as  high  as  100  MPixels/second). 

•  exploring  conditional  replenishment 

•  exploring  alternate  frame  nates 

•  Potential  modularity  of  systems  was  not  tested.  Systems  were  tested  only  as  a  whole. 

•  Header  mechanism  has  not  been  tested  for  reliability  (en-or  performance),  universality, 
usefulness. 
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’  ATM  mapping  and  error  performance  was  not  tested. 

*  Computer  text  and  graphics  on  screen  was  not  tested  (PC  or  computer  workstation 
display).  Overlay  plane  implementations  (if  used)  were  not  implemented  or  tested. 

'  Scalability  not  proposed  and  not  tested. 


•  Extensibility  not  tested.  Only  proposed  verbally. 

'  NTSC  down  conversion  not  tested  for  quality. 

•  "Production"  or  "Contribution"  quality  extensible  superset  not  tested,  although 
proposed  verbally. 

•  jpe  "high  priority  subsets"  used  in  ATRC  and  AT&T/Zenith  were  not  tested  for 
viewability  by  themselves,  although  private  demonstrations  of  viewability  have  been  promised. 

•  The  distribution  of  an  address  list  for  pay-per-view  cable  use  would  require  a  lov^r  bit 

rate  to  be  used  for  still  image  or  partial  screen  update  during  the  di^ntxrtion.  The  ^  ® 

million  subscriber  address  list  should  be  tested,  and  the  reliability  of  delivery  should  measured 
in  terms  of  the  number  of  subscriber  errors.  The  quality  of  the  image  during  the  distribution 
should  be  tested. 


•  Encryption  may  affect  the  error  performance.  The  impact  of  the  use  of  encryption  on 
'  the  quality  of  the  picture  should  be  tested.  The  protection  afforded  by  propped  enc^^ion 

schemes  should  be  evaluated.  The  interaction  of  headers  and  encryption  should  be  tested, 
including  "in  the  clear  headers  in  an  otherwise  encrypted  data  stream  for  such  uses  as 
authorization  of  decryption  to  new  users  being  add^  on. 

•  A  key  use  of  advanced  television  may  be  professional  consultation  and  collaborative 
work.  A  multi  person  teleconference  should  be  tested.  An  example  might  be  a  medical 
consultation. 


•  Some  of  the  system  proponents  indicated  compatibility  with  digital  oompr^ed  NTSC 

proposals  to  Cable  Labs.  Such  compatibility  should  be  tested  for  receiver  comf^tibility, 
distribution  compatibility,  and  interoperability  compatibility.  Some  claimed  that  t*!©  ^ 

NTSC  standard  would  only  be  selected  after  the  advanced  television  standard.  This  ^ould  be 
verified  if  true,  but  testing  for  interoperability  and  compatibility  will  still  need  to  occur  poor  to 
selection  of  any  given  compression  algorithm. 

•  The  use  of  a  VCR.  laser  disk,  or  other  devices  implies  a  digital  port,  wire,  and  signal.  The 
specifications  reliability,  and  efficiency  of  such  connections  should  be  evaluated  pnor  to 
sS^on  of  any  such  signal  designations.  If  industry  standard  digital  formats  are  used  (such  as 

PI 394),  then  these  signals  should  be  tested  on  these  formats. 


•  There  was  substantial  discussion  of  higher  quality  signal  feeds.  Channels  which  might 
have  wider  bandwidth  than  20  Mbits  per  second  include  ATM  networks,  long  haul  fiber  (eg. 
SONET)  cable  TV.  satellite  transponder  channels,  cellular  digital.  OFDM,  or  otter  networks.  Tte 
proposate  should  be  tested  with  these  other  data  rates  for  compatibility  ^d  quality.  The  mapping 
of  reduced  data  rate  e>4ractions  of  20  Mbits/sec  from  higher  ^a  rates  should  te  tested.  Tte 
methods  for  simultaneously  generating  20  Mbits/sec  h  addition  to  higher  rates  from  a  source 
signal  should  be  tested. 


•  Some  people,  such  as  Jim  Oark.  Chairman  of  Sflicon  Graphics,  feel  that  30  graphics  will 
be  routinely  affordable  for  home  receivers  and  computers.  No  testing  of  3-D  graphics 
Interoperability  was  investigated. 
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Attached  you  wUl  find  my  ATV  Interoperability  Evaluation  Matrix.  In  addition  to  the  Matrix  you  will  find 

below,  some  of  my  thoughts  and  my  guidelines  used  for  completing  the  Matrix. 

1.  First  of  all,  the  present  deadline  for  the  US  HDTV  standard  is  close.  Therefore,  the  proponents  cannot 
make  any  major  hardware  nor  software  changes. 

2.  Changes  which  effect  the  tested  video  and  audio  performances  cannot  be  accepted  without  having  the 
proposals  at  least  partly  retested. 

3.  The  cost  ratio  between  the  decoder  and  the  rest  of  the  receiver  (display,  power  supply,  cabinet,  audio 
amplifiers,  loudspeakers  -  RF  tuner?  -  etc.)  will  be  about  30%  -  70%,  perhaps  even  more  in  the  later 
phase.  Therefore,  we  can  accept  added  complexity  in  the  system  if  this  improves  the  scalability  and  the 
extensibility  of  the  system.  In  the  above  example,  increasing  the  cost  of  the  decoder  by  50%  would 
increase  the  cost  of  the  receiver  15%. 

4.  The  complexity  of  the  system  cannot  however  go  so  far,  that  the  cost  of  the  encoder  gets  out  of  hand  or 
makes  a  portable  encoder  impractical  in  the  future. 

5.  The  selected  systems  sliould  not  favor  NTSC  too  much.  Eventually,  NTSC  will  be  disbanded.  Tlic 
artifacts  converting  from  HDTV  to  NTSC  will  be  masked  by  the  hTTSC  systems  own  artifacts.  The  basic 
frame  rate  of  60Hz  (or  even  72  Hz  with  a  hierarchy  of  12, 24, 48, 72)  could  enhance  US  posidon  as  the 
main  provider  of  TV  programs  (movies  and  live  TV  as  well)  in  the  world. 

6.  Square  pixels,  this  issue  which  has  caused  many  heated  discussions,  should  be  discussed  once  more  in 
light  of  interoperability.  My  dilemma  is  as  follows: 

•  The  present  systems  have  been  tested  using  analog  signals  as  an  input  to  the  comprc.s.sinn 
encoder,  each  proponent  then  digitizes  this  video  signal  providing  the  scanned  image. 
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•  In  the  near  future  all  studio  processing  will  be  digital,  and  the  video  as  well  as  the  other  data 
coming  to  the  encoder  will  be  digital. 

•  The  production  standard  for  the  studios,  should  it  be  the  same  as  the  tnin.smi.ssion  standard 
(active  pixels  per  frame)?  IVaditionally  in  the  analog  world  the  production  quality  was  higher 
titan  the  transmission  quality.  To  maintain  square  pixels  for  both  transnmsion  and  production 
can  only  be  done  if  the  numbers  stay  die  same,  if  the  production  quality  has  to  be  better,  thin 
both  the  line  and  pixel  rates  have  to  be  modified  to  maintain  the  square  pixels. 

7.  Packeiizing  the  data  is  essential  for  good  network  operations  as  well  as  for  VCR’s  and  Computers. 

8.  Rcganiing  (receiver)  scalability,  I  would  like  to  classify  the  teccivers,  and  video  quality  as  follows: 

•  <10  Lich,  portable  receivers 

•  20  -  30  inch  regular  viewing 

•  >30  inch  tliPi  receivers 

I  don’t  think  that  DC  component  of  DCH"  is  adequate  for  any  viewing. 

9.  Can  the  encoder  provide  outputs  for: 

a)  200*400  Mbs  “lossless  compression,”  without  temporal  compression  enabling  simple  effects 

such  as  wipes  and  insertions. 

b)  40-45  Mbs  distribution  level,  allowing  cuts  at  edit  points. 

10.  Almost  everyone  agrees,  that  eventually  the  systems  will  be  progressively  scanned.  Either  diis  has  to  be 
put  into  the  systems.from  the  beginning  or  the  proponents  have  to  show  how  die  older  receivers  can 
handle  this  type  of  data.  Interlace  will  provide  artifacts  on  the  computer  displays. 

11. 1  hope  that  the  promises  and  proposals  given  for  the  added  interoperability  do  not  change  too  much  the 
numbers  established  by  WP-3  for  the  cost  analysi.'?. 

12.  A  few  words  about  the  systems. 

•  Because  the  Narrow  Muse  is  an  analog  system,  it  cannot  be  rated  for  digital  interfaces. 

•  Digicipher  and  CCDC  four  have  vertical  panels,  this  provides  nice  commonality  for  compo¬ 
nents  and  works  well  with  their  plans  for  NTSC  transtpissions,  reduces  cost,  but  what  about 
the  future  expandability?  • 

Finally.  I  personally  favor  a  computer  compadble  approach,  but  within  reason,  TV  for  most  consumers  will 
be  a  one  way  media  and  not  a  LAN  type  network,  although  it  is  getting  much  more  sophisticated,  pay  per 
view  type  operations  and  certain  other  low  level  interactive  funedoms  will  become  more  common.  Tlic  ratio 
between  tegular  viewing  and  computer  type  operations  will  remain  high  (perhaps  l(X):l  or  more),  therefore 
we  cannot  penalize  the  majority  of  viewers  by  increasing  the  cost  too  much  or  by  making  the  viewing  too 
complicated. 
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JKH/Jay 

c^:  Bob  Sanderson 

October  5, 1992 


Sincerely, 

AoAy 

✓^Jukka  Hamalainen 
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a)  Spectrum  efficiency  of  AD-HDTV  suffers  on  cable  because  of  co-channel  notch.  > 

b)  Considerations  were:  easy  reassignment  of.  bits,  easy  addition  of  other  services  zJd 

general  format  of  information.  / 

I 

c)  All  need  conversion.  NTSC  information  is  not  immediately  accessible  from 
compressed  data  stream.  Conversion  to  NTSC  is  most  important  to  consumer. 

d)  AD-HDTV  requires  scan  change  in  receiver.  DSC film  mod^,  but  can  recognize 
film  in  its  basic  algorithm. 

e)  Considerations  were:  square  pixels  and  progressive  scan. 

f)  100  ms  is  needed  for  channel  change  (and  preferably  other)  interactive  functions. 

g)  MPEG  and  240M  were  major  considerations. 

h)  Access  to  program  information  to  make  simple  receiver  so  that  receiver  need  not 
parse. 

i)  Considerations 

1)  System  functions  (i.e.  cost)  should  be  in  other  parts  of  the  system  as  much  as 
possible  so  that  all  receivers  can  be  low  cost. 

2)  Scalability  and  especially  extensibility  and  interoperability  must  not  eliminate 
the  possibility  of  plain  vanilla  TV  sets. 

3)  Concept  of  zero  cost  default  -  After  high  definition  requirements  have  been 
satisfied  in  the  system,  other  useful  information  is  formatted  in  such  a  way 
that  there  is  not  additional  cost  to  products  which  do  not  use  that 
information. 

4)  Same  as  note  h 

5)  Most  cost  of  receivers  are  not  in  signal  processing  and  the  cost  of  this  section 
is  the  easiest  to  cost  reduce.  The  proposed  system  must  have  little  impact  on 
other  parts  of  the  receiver  than  the  DSP  section(s). 
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ACATS  PS-WP4  Interoperability  Review  Notes  * 

October  6, 1992 

Interoperability  witli  'Computers' 

^Digital  technology'  and  interactive  systems'  as  used  in  the  matrix  are  ambiguious. 
No  scores  are  given.  Answers  given  for  'computer'  interoperability  '  should 
apply 

Definition  of  'Computer' 

The  base  assumption  herein  is  that  functional  differences  are  rapidly  diminishing 
between  computers  ,  television  and  telephones.  The  term  'computer  '  used  here 
refers  arbitrarily  to  intelligent  devices  for  display,  creation,  and  communication  of 
mixed  data  and  media.  Computers  are  handheld,  on  desktops,  TVtops,  and 
eventually,  displayed  on  flat  wall  saeens. 

Basis  for  Ratings 

ATV  systems*  interoperability  with  computers  is  determined  here  by  averaging 
separate  factors  determining  complexity  and  economy  of  transcoding  between 
applications,  media  and  channels: 

•  Syntax  for  headers/descriptors  and  otfier  data  structures 

•  Use  of  square  sampling  grids  (square  pixels) 

•  Progressive  or  Interlace  image  display 

•  Spatial  Resolution 

•  Display  refresh  rate 

•  Scalability 

•  Interoperability  with  packet  networks 


Worksheets  for  each  of  the  proponents  are  attached 
Advanced  Digital  HDTV 
Digital  Spectrum-Compatible  HDTV 
Chaimel-Compatible  Digicypher  HDTV 
Digicypher  HDTV 
Narrow-MUSE  HDTV 
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Advanced  Digital  HDTVAD-HDTV 
Score 

2.42V  •  Average 

•  Syntax  for  headers/descriptors  and  other  data  structures 

2  MPEG  syntax  is  well  defined,  but  not  universal 

•  Use  of  square  sampling  grids  (square  pixels) 

2V  non-square  (square  mode  needs  verificatiorO 

•  Progressive  or  interlace  image  display 

2  interlace  (supports  progressive  at  lov/er  data  rate) 

0  Spatial  Resolution 

3  1440x960  not  easily  transcoded 
0  Display  refresh  rate 

3  59.94  not  easily  transcoded 

0  Scalability 

3V  value  of  HP  stream  to  be  determined 

•  Interoperability  with  packet  networks 

2V  needs  verification 
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Channel-Compatible  Digicypher  HDTV  CCDC-KDTV 


Score 
2  ,42NV 

3N 


1 


1 


3 


3 


«  Average 

o  Syntax  for  headers/descriptors  and  other  data  structures 
syntax  is  not  defined,  may  be  unproved  (?) 

•.  Use  of  square  sampling  grids  (square  pixels) 
square 

«  Progressive  or  interlace  image  display 
progressive 

*  Spatial  Resolution 

1280x720  not  easily  transcoded 

•  Display  refresh  rate 

59.94  not  easily  transcoded 


•  Scalability 

3V  value  of  DCT  to  be  determined 

•  Interoperability  Vifith  packet  networks 


3N 


needs  additional  data 
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•  Average 

o  Syntax  for  headers/descriptors  and  other  data  structures 
Syntax  is  not  well  defined,  but  may  be  improved  (?) 

•  Use  of  square  sampling  grids  (square  pixels) 

non-square 

•  Progressive  or  interlace  image  display 

interlace  (verification  required  of  alternate  display) 


3 


3 


4V 


•  Spatial  Resolution 

1440x960  not  easily  transcoded 

•  Display  refresh  rate 

59.94  not  easily  transcoded 

.  (verification  required  of  alternate  frame-rate) 

•  Scalability 

to  be  determined 

•  Interoperability  with  packet  networks 


3N 


Additional  data  needed 
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Digital  Spectrum-Compatible  HDTV  DSC-HDTV 
Score 

2.28NV  o  Average 

«  Syntax  for  headers/descriptors  and  other  data  structures 
3N  syntax  is  not  well  defined,  may  be  improved  (?) 

0  Use  of  square  sampling  grids  (square  pixels) 

1  square 

«  Progressive  or  interlace  image  display 
1  progressive 

®  Spatial  Resolution 

3  1280x720  not  easily  transcoded 

«  Display  refresh  rate 

3  59.94  not  easily  transcoded 

e  Scalability 

3V  value  of  scalable  DCT  to  be  determmined 

•  Interoperability  with  packet  networks 

2N  additional  data  needed 
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Narrow-MUSE  HDTV  Narrow-Muse 

Score 

3.7V  •  Average 

•  Syntax  for  hcaders/descriptors  and  other  data  structures 

5  Syntax  is  not  defined 

•  Use  of  square  sampling  grids  (square  pixels) 

3  non-square 

•  Progressive  or  interlace  image  display 

3  interlace 

•  Spatial  Resolution 

3  1920x1035  not  easily  traru:coded 

•  Display  refresh  rate 

3  59.94  not  easily  transcoded 

•  Scalability 

4V  value  of  sub-sampled  four-field  to  be  determined 

•  Interoperability  with  packet  networks 
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iv»rv  twterqPBPARTLITY  FVALUATIQN-MATRIX 
Introduction: 

Of  the  ten  performance  evaluation  criteria,  which  the  rcc  will  be  ( 

reach  ItSdnieloTon  which  of  the  five  propoaed  hTV  eyetem 

the  terreotrial  broadcaat  induetry,  5y'®®-5*][JflnIrSliJty**2xtehJiblilty  aS 
potential  ueere  of  the  electronic  medial  JnteroperahiJity,  £xcen»«J-ticy  enw 

Scope  of  Services. 

It  is  being  esBumed  that  digital  advanced  telovlalon  is 

today's  analog  television  and  that  It  could  ^£er  the  ^ 

useful  and  CMt-effective  Interchange  of  electronlo 

aesoolated  data  among  different  signal  fexnate#  ^ 

tuditi,  among  different  applications,  a^ng  different 
different  performanoe  levels,  and  would  therefore  meet  most  of  the 
r0quir«Mnti  of  Int^rop^tMbility • 

It  la  also  assumed  that  the  digital  representation  of  ‘‘i^h-resolution  ^ges 

would  have  the  properties  that  permit  future  ?5 

format  while  retaining  some  measure  of  interoperability  and  therefore 

oooolv  with  the  oonospts  of  extensibility.  The  digital  representation  of  the 

iSgiJ  JhSld  prSfSJably  be  of  a  hlerarehloal  "‘ture  of  spatial  resolution, 

iS^ral  rates  and  image  aspect  ratios  so  that  a 

In^  data  stream  can  be  selected  for  transmiseion,  storage 

noodod  for  a  epacifio  appllcatiOAe  ThiB  than  could  maat  the  objactivaa  of 

«aaiahilit/e 

The  third  requirement,  scope  of  services  Is  related  to  the  channel  oapaolty 
for  ancillary  data  supported  by  the  ATV  system. 

Preaentatiens  bv  prcponenta: 

All  presentations  followed  the  ATV  interoperability 
attwpted  to  explain  how  well  the  ATV  systms  met  or  could 
requirements  listed  in  this  matrix.  Kot  addressed  were  the 
of  reformatting  the  data  and  the  coat  Implloatlons  and 

eompromisss  which  the  missing  fsatures/requlrements  would  Impose  If  they  were 
iffiplomontode 

It  is  noosssary  to  prioritise  this  matrix  as  most  ATV  receivers 

for  broadcaet/cable  reosptlon.  For  these  applications  reoelvers  should 

burdened  with  extra  circuitry  that  may  never  ^  used.  However#  the 

stream  could  be  structured  to  allow  tna  most  important  features  of  tte  matrix 

to  be  Implemented  if  It  does  not  comproolss  the 

sffectl^ness  of  the  primary  applloatlont  ATV  terrestrial  broadcast  reception. 
The  cost  to  Incorporate  these  additional  features#  not  relevant  to  terrestrial 
broadcaBtin^r  ahould  ba  claarly  ctatade 

It  appears  that  the  main  goal  of  all  digital  system  Pfopo»®«t* 
in  e*^ry  limited  time#  e  prototype  ATV  system  that  would  be 
dioital  terrestrial  transmission  in  a  6  NHs  lUT  channel.  To  make  this  Mssible# 
very  high  video  compression  ratios  are  used  and  this  My  aMke  leso^efflelent 
so^able  compression  algorithms  less  likely  to  be  ^plemented,  from  the 
dlsouesionsT^it  appeared  that  true  scalability  -  different  bit  rates  and 
different  quality  levels  -  is  not  present  in  any  of  the  systems. 

^  xithoueh  not  part  of  this  evaluation,  broadcasting  requires  a  set  of  related 
i  aomormMmLon  levels  for  etudlo  applications  I  recording,  film  processing,  post" 
MtSlltrdUtJlbutSn  to  local  dfflUatSd  -tations  at  a  bit  rat. 
hiqh  enough  to  allow  further  processing.  Suggested  data  rates 
applications  oould  be  between  200  Kbs  and  45  Mbs. 

i^*0t  the  ohoice  of  compression  algorithm  for  terrestrial  transmission. 

1  \ 
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At  auch  tim«  th*t  a  oloarly  defined  list  of  required  features  is  agreed  upon, 
proponents  should  be  given  additional  ti«o  to  verify  if  and  at  what  cost  these 
features  can  or  should  be  implemented.  It  should  be  noted  that  some  of  the 
features  listed  in  the  evaluation  matrix  were  not  even  known  and  others  not 
defined  when  the  XTV  systems  were  well  advanced  in  their  development  to  meet  / 
the  ACXTS  Lab  test  deadlines.  Proponents  may  also  decide  not  to  implement 
certain  features  of  the  matrix.  This  too  should  be  made  known.  > 

Since  the  ATV  receiver  will  be  multi''purpose|  ATV,  ITTSC,  VCR,  cable  friendly, 
it  is  Important  that  for  ease  and  coot  effective  conversion  between  ATV  and 
NTSC  in  both  directions  the  scanning  parameters  should  have  a  simple 
relationship  to  each  other  so  that  television  signals  of  different  format  and 
sources  oan  be  processed  for  use  by  a  common  display.  Also,  the  signal  format 
should  be  such  as  to  make  coat  affective  VCRs  possible  and  these  VCRs  should 
provide  the  same  features  as  present  day  VCRs. 

Progressiva  scanning  and  square  pixels  offer  some  powerful  advantages  over 
interlace  scanning  for  all  forme  of  signal  processing  but  the  trade-off  may  be 
extensibility.  Future  upgrade  from  interlace  to  progressive  scanning  using  the 
DigiCipher  or  AOTV  parameters  may  be  easier  to  accomplish  than  upgrading  to  an 
as  yet  unknown  scanning  format  using  the  Zenith/AT(T  or  KIT  parameters. 


Capital  Cities/ABC,  Inc. 
October  5,  1992 
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APPENDIX  III 


REVISED  REVIEW  BOARD  RESPONSES 


PANASONIC  TECHNOLOGIES  Inc. 

Matsushita  Applied  Research  Laboratory 

95D  Connecticut  Drive,  Burlington,  NJ  08016  USA 

Telephone  609-386  5995  Fax  609-386  4999 

TO; 

MR.  GUYBEAKLEY 

FROM; 

JUKKA  HAMALAINEN 

SUBJ; 

MARL 

PAGES: 

Dear  Guy, 

1 

Attached  you  will  find  my  updated  chart.  Not  too  many  changes! 

tlie  mom  we  liave  studied  the  sy.stems,  the  closer  they  get  -  the  same  basic  principles  apply  to  all  to  a 
large  extent,  DCT,  etc. 

This  became  very  obvious  when  my  “sister”  lab  ATVL  (mainly  for  receiver  research)  made  a  cost  com¬ 
parison  for  compression  encoders  for  WP-3. 

In  addition  to  the  attached  chan,  I  have  the  following  comments: 

1.  General  extensibility 

•  G1  has  limited  color  resolution. 

2.  Computer  rating 

1. DSC 

2. ADHDTV 

3. GI 

4. CCDC 

3.  General  flexibility 

1. DSC 

2.  ADHDTV,  the  MPEGl  frame  structure  might  hinder  future  developments. 

3.  GI  too  much  emphasis  on  NTSC 

4.  (XlDC  only  considers  TV  transmission. 

Finally  I  have  attached  here  two  charts,  which  I  have  used  for  my  general  discussions  for  comparing  me  . 
basic  (video)  features  of  the  proposed  systems. 


Sincerely, 


JKH/jay 

November  2, 1992 
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EXECUTIVE  SUMMARY 

The  objective  of  Planning  Subcommittee  Working  Party  4  (PS-WP/4)  was  to 
study  and  make  recommendations  regarding  the  relationship  of  terrestrial  advanced 
telewsion  systems  to  alternative  media,  applications  and  standards.  It  was  also  the 
objective  to  Investigate  approaches  for  growth  paths  to  the  future  while,  at  the  same  time, 
to  support  timely  decisions  on  an  advanced  television  (ATV)  broadcast  system  with 
increased  performance  quality  for  the  end  user.  Participants  of  Working  Party  4  have 
addressed  issues  related  to  Interoperability,  scalability  and  extensibility  and  more 
generally,  openness.  Representatives  of  the  broadcast  television,  cable  television,  program 
production,  motion  picture,  computer,  telecommunications,  and  imaging  industries  were 
active  in  this  working  party. 

In  the  prior  year's  effort  (1991),  PS-WP/4  developed  definitions  of  key  terms  such 
as  interoperability,  scalability  and  extensibility.  Based  upon  a  world  becoming  more 
complex  and  richer  in  alternatives  (media,  transmission/distribution,  preseritations),  the 
working  party  developed  the  concept  of  image  data,  defined  as  the  digital  equivalent  of  the 
video  information  including  image,  sound  and  auxiliary  data  components.  As  a  result, 
PS-WP/4  recommended  the  following  In  its  December  1991  Interim  Report: 

•  Maximize  utilization  of  digital  video  techniques  and  image  data  representation. 

•  Apply  HEADERS  and  DESCRIPTORS  (as  agreed  by  industry  standards 
groups)  as  a  method  of  identifying  image  data. 

Once  the  Systems  Subcommittee  Working  Party  4  (SS-WP/4)  established  the  ten 
selection  criteria,  PS-WP/4  adjusted  its  focus  to  concentrate  on  the  three  criteria  that 
related  to  alternative  media:  Interoperability,  Scope  of  Services  and  Features,  and 
Extensibility. 

An  assessment  of  the  five  proponent  systems  in  reference  to  the  above  three  criteria 
was  made  by  PS-WP/4.  PS-WP/4  developed  an  OSI-Ilke  layered  architectural  model  for 
ATV  to  aid  In  evaluating  the  proponent  systems  along  with  applications  and  performance 
questions  on  these  criteria.  PS-WP/4  employed  a  technical  consultant,  StellaCom,  Inc.,  to 
assist  In  this  analysis.  The  assessments  were  based  upon  information  supplied  by  each  of 
the  proponents  in  (1)  published  form,  (2)  response  to  specific  PS-WP/4  questions  and  (3)  a 
three-day  Interoperability  review  Involving  the  proponents  and  a  Special  Interoperability 
Review  Board  (convened  specifically^  for  evaluation  of  the  proponent  systems  relative  to  the 
three  criteria  and  conducted  In  September  1992).  The  Review  Board  consisted  of  experts 
across  a  broad  array  of  relevant  disciplines.  The  selected  experts  had  no  relationship  to 
any  of  the  system  proponents.  Results  of  the  Review  Board  evaluation  weighed  heavily  in 
the  PS-WP/4  conclusions  and  recommendations. 
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PS-WP/4  has  identified  a  number  of  characteristics  that  contribute  significantly  to 
interoperability,  Scope  of  Services  and  Features,  and  Extensibility.  These  are  based  on 
needs  and  desires  exhibited  by  alternative  media  advocates,  not  only  for  the  delivery  of 
terrestrial  broadcast  television  programming  but  also  for  other  delivery  approaches  and 
applications  relating  to  computing,  communications,  motion  pictures  and  imaging.  In 
relative  order  of  importance,  these  characteristics  are: 

•  An  all-digital  implementation  based  on  a  layered  architecture  model 

•  The  use  of  universal  headers  and  descriptors  (as  agreed  by  industry  standards 
group,  for  example,  SMPTE) 

•  Transmission  of  the  signal  in  progressive  scan  format 

•  Use  of  a  flexible,  packet  data  transport  structure 

•  Viewer  transparent  channel  re-allocation  (limited  picture  and  sound  while  most  of  the 
channel  capacity  is  devoted  to  data  transmission  for  conditional  access  addressing 
or  other  purposes) 

•  Ability  to  implement  lower-performance,  low-cost  ATV  receivers  (comparable 
price/performance  options  to  current  NTSC  receivers) 

•  Ability  to  implement  low-cost  ATV  consumer  VCR 

•  System  architecture  and  implementation  that  will  allow  Improvements  and  extensions 
to  be  incorporated  as  technology  advances  while  maintaining  backward  compatibility 

•  Square  pixels  or  at  least  the  option  to  select  square  pixel  presentation 

•  Compatibility  with  relevant  international  standards  or  commitment  to  this  objective 

•  Easily-implementable  and  user-accessible  "still/motion  multi-window  transmission" 

The  PS-WP/4  assessment  and  evaluation  of  the  proponent  systems  shows  some 
significant  differences  under  the  three  criteria.  Further,  ail  proponent  systems  need 
improvement  on  one  or  more  of  the  listed  characteristics  to  achieve  a  desirable  degree  of 
interoperability,  extensibility  and  scope  of  services  and  features.  It  is  recommended  that 
the  Special  Panel,  the  Advisory  Committee  and  the  FCC  take  these  differences  into 
account  In  the  process  of  selecting  an  ATV  standard.  Furthermore,  selection  of  a  system 
that  incorporates  interoperability  features  not  included  in  the  system  as  submitted  for 
testing  requires  verification  and/or  testing.  The  system  submitted  for  field  testing  should 
also  include  such  features. 
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BACKGROUND 

The  objective  of  Planning  Subcommittee  Working  Party  4  (PS-WP/4)  was  to  study 
and  make  recommendations  regarding  the  relationship  of  terrestrial  advanced  television 
systems  to  alternative  media,  applications  and  standards,  It  was  also  the  objective  to 
investigate  approaches  for  growth  paths  to  the  future  while,  at  the  same  time,  supporting 
timely  decisions  on  an  advanced  television  (ATV)  broadcast  system  with  increased 
performance  quality  for  the  end  user.  Participants  of  Working  Party  4  have  addressed 
issues  related  to  Interoperability,  scalability  and  extensibility  and  more  generally,  openness. 
Representatives  of  the  broadcast  television,  cable  television,  program  production,  motion 
picture,  computer,  telecommunications,  and  imaging  industries  were  active  in  this  working 
party. 


In  the  prior  year's  effort  (1991),  PS-WP/4  developed  definitions  of  key  terms  such  as 
interoperability,  scalability  and  extensibility.  Based  upon  a  world  becoming  more  complex 
and  richer  In  alternatives  (media,  transmission/distribution,  presentations),  the  worWng 
party  developed  the  concept  of  image  data,  defined  as  the  digital  equivalent  of  the  video 
information  including  image,  sound  and  auxiliary  data  components.  As  a  result,  PS-WP/4 
recommended  the  following  in  its  December  1991  interim  Report: 

•  Maximize  utilization  of  digital  video  techniques  and  image  data  representation. 

•  Apply  HEADERS  and  DESCRIPTORS  (as  agreed  by  industry  standards 
groups)  as  a  method  of  identifying  image  data. 

In  1992,  PS-WP/4  developed  an  ISO-like  layered  architectural  model  for  ATV, 
consisting  of  seven  layers  as  shown  below: 
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The  application  of  this  architectural  model  in  the  working  party’s  analyses  is  shown  in 
the  Digital  Video  Reference  Model  For  interoperability  in  the  following  figure. 
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PS-WP/4  examined  relationships  of  terrestriai  advanced  television  sy^ems  to 
alternative  media,  applications  and  standards  In  the  broad  context  of  recognized  and 
anticipated  advances  in  computing,  communications  and  imaging  technology.  Weighting 
the  value  (e.g.,  market  and  social  benefit)  of  Interoperability  or  extensibility  other  than  in 
established  markets  (e.g.  terrestrial  broadcast  or  cable  television  programming)  was  difficult 
and  arguable.  Formal  positions  from  affected  industries  were  not  available. 

Throughout  PS*WP/4's  investigation,  few  disagreed  that  the  U.S.  FCC  ATV  dedsion 
should  endure  over  some  significant,  foreseeable  technology  horizon  which  should  be  at 
least  several  decades. 

Furthermore,  few  disagreed  that  over  this  technology  horizon,  concepts  that  are 
today  limited  by  cost  considerations  for  the  consumer  market  will  become  reality  in  the 
future.  These  concepts  Include,  at  the  least,  progressive  scan  (for  capture  and 
presentation)  and  motion  Image  data  delivery  over  packetized  wideband  networks. 

PS-WP/4  believes  that  the  FCC  ATV  decision  must  be  considered  as  defining  a 
starting  point  and  assuring  a  migration  path  to  the  future,  rather  than  as  defining  an  end 
point. 

Assessment  of  Proposed  ATV  Systems 

PS-WP/4  was  assigned  the  task  of  assessing  the  Interoperability  of  the  proponent 
systems  with  media  other  than  terrestrial  broadcasting.  In  addition,  the  systems  were  to  be 
evaluated  with  regard  to  extensibility  and  scope  of  services  and  features.  A  preliminary 
assessment  was  written  in  early  August  by  StellaCom,  Inc.  based  on  published  Information 
available  at  the  time.  This  report  Included  Interoperability  questions  that  were  to  be 
answered  by  the  proponents  in  a  planned  Interoperability  Review  in  September  1992. 

Some  25  generic  questions  on  Interoperability,  extensibility,  and  scope  of  services 
and  features  were  compiled  for  all  proponents.  An  additional  seven  to  ten  system-specific 
questions  were  also  addressed  to  each  proponent.  Interoperability  assessments  were 
made  with  regard  to  cable  TV,  digital  technol^y,  headers/descriptors,  NTSC,  film, 
computers,  satellites,  packet  networks.  Interactive  systems,  format  conversion  and 
scalability.  The  Initial  use  for  ancillary  data,  audio,  data,  text,  captioning,  encryption, 
addressing,  capability  for  a  very  low-cost  receiver  with  lower  performance  and  VCR 
capability  were  assessed  as  services  and  features.  Extensibility  to  no  visible  artifacts 
regardless  of  the  detail  and  motion  in  a  scene,  to  studio  quality  (editable),  to  higher 
resolution  such  as  VHDTV  and  UHDTV  and  provision  for  future  compression 
enhancements  were  examined. 

The  proponents  answered  all  questions  In  writing  at  the  Interoperability  Review  on 
September  23-25.  They  were  given  additional  time  to  submit  any  proposed  Interoperability 
Improvements  to  their  systems.  A  revised  Interoperability  assessment  was  then  generated 
Including  the  updated  material.  The  proponents  were  given  the  opportunity  to  critique  the 
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assessment.  The  assessments  were  then  revised  and  abbreviated  to  five  to  six  pages  for 
each  proponent  to  be  submitted  to  the  ACATS  SS-WP/4  Special  Panel.  The  result  is 
attached  as  Reference  1 . 


Interoperability  Review 

An  interoperability  Review  was  held  with  the  ATV  proponents  to  focus  on 
interoperability,  extensibility  and  scope  of  services  and  features.  Two-hour  presentations 
were  made  by  each  proponent  followed  by  1/2  hour  of  questions  from  the  Review  Board  of 
multi-industry  experts  and  1/2  hour  by  other  PS-WP/4  members. 


The  Review  Board  members  were: 

Jules  Bellisio 
Jules  Cohen 
Gary  Demos 
Jack  Fuhrer 
Branko  Gerovac 
Jukka  Hamalainen 
George  Hanover 
Robert  Hopkins 
Ed  Horowitz 
Mike  Liebhold 
Craig  Tanner 
Arpad  Toth 
Tony  Uyttendaele 


Bellcore 

Consuttant/MSTV 
Demographx 
Hitachi  America 
DEC 

Matsushita  Applied  Research  Lab 
Electronic  Industries  Association 
ATSC 
Viacom 

Apple  Computer 
CableLabs 
Eastman  Kodak 
CapCities/ABC 


The  proponent  presentations  in  the  Interoperability  Review  gave  strong 
consideration  of  interoperability,  extensibility  and  scope  of  services  and  features.  The 
proponent  ATV  system  descriptions  contained  ewdence  of  effort  toward  enabling 
interoperability  with  alternative  media.  The  written  material  distributed  at  the  meeting  is 
found  in  Reference  3:  Minutes  of  the  PS-WP/4  Interoperability  Review,  September  23-25, 
1992. 


Additional  information  was  submitted  by  some  proponents  on  October  1. 1992  and 
can  be  found  in  Reference  4:  Minutes  of  the  PS-WP/4  meeting,  November  5.  1992. 
Evaluations  from  the  Interoperability  Review  were  submitted  to  StellaCom,  Inc.  by  members 
of  the  Review  Board  for  compilation  and  analysis.  As  much  as  possible,  the  evaluations 
were  based  on  "what  the  evaluator  expects  the  proponents  will  deliver  commerdally".  The 
proponents  were  also  invited  to  evaluate  the  proposed  systems.  The  results  from  these 
evaluations  are  attached  as  Reference  2.  It  contains  summaries  of  the  ratings  by  the 
proponents,  the  Review  Board  and  the  ratings  by  individual  experts. 


6 


PS-WP/4  FINAL  REPORT 


Conclusions 

Based  on  assessments  of  the  five  proposed  ATV  systems  and  the  Interoperability 
Review  conducted  on  September  23*25, 1992,  PS-WP/4  reached  the  following  conclusions 
with  respect  to  the  selection  criteria  of  Interoperability,  Scope  of  Services  and  Features,  and 
Extensibility.  See  Reference  2  for  numerical  ratings  and  detail. 

Overall 

During  the  assessment  and  review  process,  all  proponents  gave  strong 
consideration  to  meeting  the  three  selection  criteria  and  presented  evidence  to  show  how 
their  systems  met  or  could  meet  the  criteria. 

The  four  all-digital  systems  (AD-HDTV,  CCDC,  DIgICipher  and  DSC-HDTV)  were 
found  to  be  superior  to  the  Narrow-MUSE  system  with  respect  to  Interoperability  and 
Extensibility  on  all  but  two  or  three  characteristics  where  Narrow-MUSE  was  approximately 
equal  to  the  all-digital  systems.  With  respect  to  Scope  of  Services  and  Features,  the  all- 
digital  systems  also  ranked  better  than  Narrow-MUSE  but  the  difference  was  smaller. 

Interoperability 

•  All  five  systems  were  judged  to  be  easily  deliverable  by  satellite  or  Cable  TV. 

•  The  four  all-digital  systems  were  judged  to  interoperate  well  with  NTSC. 

•  The  four  all-digital  systems  were  judged  to  Interoperate  well  with  film  with  CCDC 
ranking  better  than  the  other  three  systems. 

•  Interoperability  with  computers  was  judged  to  be  more  difficult  than  with  satellites. 
Cable  TV,  NTSC  and  film.  DSC-HDTV  ranked  slightly  better  than  CCDC.  with  AD- 
HDTV  next,  and  DigiCipher  ranked  somewhat  worse. 

•  Progressive  scan  and  square  pixels  are  Important  for  computer  and  other  Image 
applications  (rotation,  scaling,  still-image,  LCD  displays.  ...).  Computer  Industry 
applications  desire  greater  than  70  Hz  display  rates  although  no  proponent  system 
offers  this  feature. 

•  Ail  of  the  proponents  of  all-digital  systems  now  recognize  that  the  use  of  headers 
and  descriptors  Is  a  critical  enabling  concept  for  assuring  ATV  flexibility  In  the  future. 
Although  these  four  proponents  propose  to  Include  headers  and  descriptors  In  their 
systems,  only  AD-HDTV  had  rts  final  proposal  for  headers  and  descriptors  fully 
Implemented  at  the  time  the  system  was  tested  by  ATTC,  and  this  system  received 
the  highest  rating  on  this  characteristic.  There  is  need  for  definitive  work  on  an 
Industry  standard  for  headers/descriptors.  The  SMPTE  work  is  a  candidate. 
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•  All  of  the  proponents  of  all-digital  systems  now  recognize  that  the  use  of  a 
packetized  data  structure  combined  with  headers  and  descriptors  is  important  for 
future  network  and  telecommunications  applications  of  ATV.  Although  these  four 
proponents  propose  to  use  a  packetized  data  structure,  only  AD-HDTV  had  Its 
current  proposal  for  a  packetized  data  structure  implemented  at  the  time  the  system 
was  tested  by  ATTC  and  this  system  received  a  significantly  better  rating  than  the 
others  on  this  characteristic.  The  DSC-HDTV  system,  as  tested,  incorporated  a  form 
of  packet  structure  using  fixed-length  data  segments. 


•  Interoperability  with  interactive  systems  is  an  important  consideration  for  future 
applications.  The  four  all-digital  systems  were  rated  approximately  equal  with 
respect  to  this  characteristic. 


•  With  respect  to  format  conversion,  the  four  all-digital  systems  were  rated 
approximately  equal. 


•  Scalability  Is  an  important  characteristic  In  that  It  will  permit  user  price/performance 
tradeoffs  at  both  the  display  (picture)  level  and  at  the  network  (transport)  level.  All  of 
the  systems  are  scalable  at  the  picture  level,  permitting  display  quality  and  cost  to  be 
traded  over  a  wide  range.  None  of  the  systems  achieve  the  degree  of  scalability  at 
the  transmission  level  that  would  permit  similar  trade-offs  In  future  'bandwidth-on- 
demand"  network  environments  and  low-cost  receivers.  However,  the,  AD-HDTV 
and  DSC-HDTV  systems  both  employ  a  prioritized  data  structure  at  the  transmission 
level  that  provides  a  limited  degree  of  scalability. 


Scope  of  Services  and  Features 

•  With  respect  to  the  nine  characteristics  considered  for  this  selection  criterion  (initial 
use  for  ancillary  data.  Audio,  Data,  Text,  Captioning,  Encryption,  Addressing,  Low- 
cost  receiver  and  VCR  capability),  there  were  no  statistically  significant  differences  in 
the  ratings  of  the  four  all-digital  systems. 

•  The  use  of  a  packetized  data  structure  with  headers  and  descriptors  as  discussed 
above  provides  important  system  flexibility  in  meeting  this  selection  criterion.  It 
allows  channel  capacity  to  be  dynamically  assigned  to  services  and  features  as 
needed,  and  K  permits  new  services  and  features  to  be  implemented  in  the  future, 
while  maintaining  backward  compatibility  with  existing  ATV  receivers.  Computer, 
telecommunications  and  cable  TV  industry  experts  place  a  high  value  on  "viewer 
transparent  channel  data  reallocation",  permitting  instant  reallocation  of  the  majority 
of  the  channel  capacity  to  data  transmission  while  maintaining  audio  transmission 
and  limited  video  transmission,  e.g.,  a  still  image  or  motion  video  (possibly  at 
reduced  quality)  in  a  window  or  windows  on  a  still  background. 
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•  With  respect  to  low-cost  receivers,  this  characteristic  was  modified  during  the  review 
to  mean  the  ability  to  implement  an  ATV  receiver  with  reduced  performance  that 
would  sell  at  a  price  comparable  to  current  NTSC  receivers.  SS-WP/3  is  addressing 
this  issue,  and  current  indications  are  that  there  is  no  significant  difference  among 
the  five  proposed  systems,  and  they  were  rated  essentially  equal  virith  respect  to  this 
characteristic.  Further,  the  current  view  is  that  signal-processing  electronics  cost  will 
ultimately  become  an  insignificant  portion  of  the  total  receivers'  cost. 

•  The  use  of  a  packetized  data  structure  with  headers  and  descriptors  combined  with  a 
prioritized  data  structure  at  the  transmission  level  is  an  advantage  for  accomplishing 
trick  modes  in  digital  VCRs.  Special  flags  could  allow  easy  and  low-cost 
implementation  for  trick  play  and  editing. 

•  The  use  of  a  packetized  data  structure  with  headers  and  descriptors  allows  flexibility 
in  implementing  audio  services.  Independently  coded  audio  channels  can  be  used 
when  appropriate  (e.g.,  for  simultaneous  multiple  languages  during  a  newscast  or  a 
sportscast),  and  compositely  coded  audio  can  be  used  to  provide  five-channel 
"surround-sound"  for  major  movies.  The  required  data  capacity  will  vary  depending 
on  the  type  of  coding  and  the  number  of  channels. 


Extensibility 

•  Wrth  respect  to  the  six  characteristics  considered  for  this  selection  criterion 
(Extensibility  to:  no  visible  artifacts;  studio-quality  data  rate;  higher  resolution; 
VHDTV;  and  UHDTV;  and  Provision  for  future  compression  enhancements),  there 
were  no  statistically  significant  differences  in  the  ratings  of  the  four  all-digital 
systems. 

•  The  use  of  a  packetized  data  structure  with  universal  headers  and  de^riptors  as 
discussed  above,  provides  important  flexibility  in  meeting  this  selection  criterion.  For 
example,  if  a  higher  data  rate  channei  is  used  to  distribute  programming  to  network 
affiliates,  additional  packets  (with  appropriate  headers  and  descriptors)  could  provide 
higher-quality  images  for  post-production  processing  by  the  affiliate.  For  simple 
pass-through,  these  packets  would  be  stripped,  and  the  remaining  data  stream  re- 
clocked  to  the  terrestrial  broadcast  data  rate. 

•  The  proponents  of  the  four  all-digital  systems  have  all  made  provision  to  Incorporate 
future  improvements  In  their  video  compression  systems  that  will  be  backward 
compatible  with  existing  ATV  receivers.  Examples  of  possible  improvements  include 
more  accurate  motion  estimation,  better  psycho-physical  decision  rules,  improved 
buffer  management  rules,  etc. 
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System  Specific  Comments 

During  the  assessment  and  review  conducted  by  PS-WP/4,  the  proponents 
described  how  they  proposed  to  meet  the  requirements  of  the  three  selection  criteria 
against  which  they  were  being  evaluated.  In  some  cases,  these  proposals  require  changes 
to  the  system  hardware  and  software  from  that  which  was  tested  at  ATTC.  The  effects 
these  changes  could  have  on  the  results  of  the  ATTC  and  ATEL  evaluations  of  the  systems 
are  unknown.  In  other  cases,  certain  features  or  characteristics  were  present  in  the  system 
when  it  was  tested  by  ATTC,  but  were  not  exercised  during  the  tests.  Where  changes  are 
required,  rt  must  be  verified  that  these  changes  will  not  affect  the  results  obtained  by  ATTC 
and  ATEL.  Where  the  features  were  not  exercised,  it  must  be  verified  that  they  work  when 
exercised. 

All  proponent  systems  need  some  improvement  in  interoperability,  scalability  and 
extensibility.  For  example: 

AD-HDTV 

-  verification  of  the  progressive-scan  and  square-pixel  migration  strategy 

CCDC 

-  implementation  of  universal  header/descriptor  concept 

-  implementation  of  packetized  data  structure 

DigiCipher 

-  Implementation  of  universal  header/descriptor  concept 

-  development  &  verification  of  a  progressive-scan  and  square-pixel 
migration  strategy 

-  implementation  of  packetized  data  structure 

DSC-HDTV 

-  implementation  of  universal  header/descriptor  concept 

-  complete  the  implementation  of  a  fully  packetized  data  structure 

Narrow-MUSE 

-  commitment  to  digital  implementation 

-  implementation  of  universal  header/descriptor  concept 

-  implementation  of  packetized  data  structure 

-  development  &  verification  of  a  progressive-scan  and  square-pixel 
migration  strategy 
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RECOMMENDATIONS 

Based  upon  the  needs  and  desires  exhibited  by  the  alternative  media,  not  only  for 
the  delivery  of  terrestrial  broadcast  television  programming  but  also  for  other  delivery 
approaches  and  applications  relating  to  computing,  communications,  motion  pictures  and 
imaging,  PS-WP/4  has  identified  a  number  of  characteristics  that  contribute  significantly  to 
Interoperability,  Scope  of  Services  and  Features,  and  Extensibility.  In  relative  order  of 
Importance,  these  characteristics  are: 

•  An  all-digital  implementation  based  on  a  layered  architecture  model 

•  The  use  of  universal  headers  and  descriptors  (reference  ACATS  Interim 
Report,  March  1 992) 

•  Transmission  of  the  signal  in  progressive  scan  format 

•  Use  of  a  flexible,  packet  data  transport  structure 

•  Viewer-transparent  channel  re-allocation  (limited  picture  and  sound  while 
most  of  channel  capacity  devoted  to  data  transmission  for  conditional  access 
addressing  or  other  purposes) 

•  Ability  to  implement  lower-performance,  low-cost  ATV  receivers  (comparable 
price/performance  options  to  current  NTSC  receivers) 

•  Ability  to  implement  low-cost  ATV  consumer  VCR 

•  System  architecture  and  implementation  that  will  allow  improvements  and 
extensions  to  be  incorporated  as  technology  advances  while  maintaining 
backward  compatibility 

•  Square  pixels  or  at  least  the  option  to  select  square  pixel  presentation 

•  Compatibility  with  relevant  international  standards  or  commitment  to  this 
objective 

•  Easily-implementable  and  user-accessible  "still/motion  multi-window 
transmission" 

The  PS-WP/4  assessment  and  evaluation  of  the  proponent  systems  shows  some 
significant  differences  under  the  three  criteria.  Further,  all  proponent  systems  need 
improvement  on  one  or  more  of  the  listed  characteristics  to  achieve  a  desirable  degree  of 
interoperability,  extensibility  and  scope  of  services  and  features.  It  is  recommended  that 
the  Special  Panel,  the  Advisory  Committee  and  the  FCC  take  these  differences  into 
account  in  the  process  of  selecting  an  ATV  standard.  Furthermore,  selection  of  a  system 
that  incorporates  Interoperability  features  not  included  in  the  system  as  submitted  for 
testing  requires  verification  and/or  testing.  The  system  submitted  for  field  testing  should 
also  include  such  features. 
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SMPTE  HEADER/DESCRIPTOR  TASK  FORCE: 

FINAL  REPORT 


Engineering  Committee  Report 


SMPTE  Header/Descriptor  Task  Force:  Final  Report 

January  3,  1992 


This  report  of  the  SMPTE  Task  Force  on  HeadersIDescriptors  is  an  approved 
document  of  the  SMPTE  Standards  Committee  and  is  made  available  for  infbr* 
motion,  as  it  contains  valuable  proposals  concerning  the  development  of  digital 
imaging  and  video  systems  and  for  standardization  of  certain  of  their  aspects  that 
will  be  of  interest  generally  The  standardaation  aspects  of  the  report  vrill  be  fitrther 
considered  under  the  normal  processes  of  the  SMPTE  for  the  creation  and  approv¬ 
al  of  engineering  documents,  which  includes  the  opportunity  for  fitrther  comment 
and  for  public  review  prior  to  their  final  acceptance.  Persons  vhshing  to  aatvefy 
participate  in  the  dev^pment  of  these  standards,  including  attendance  at  Hbrk- 
ing  Croup  meetings  and  ballot  response,  may  contact  the  Engineering  Dept  of  the 
SMPTE  It  should  be  noted  that  engineering  documents  arising  from  the  contents 
of  this  report  may  differ  signifieanth  from  its  recommendations,  and  caution  is 
suggested  in  the  use  of  this  report  as  the  basis  of  design  or  implemeruatiort 


"  1.0  Introductioa 

Tie  Task  Force  on  Header/Desenpiors 
has  considered  the  questions  posed  in  its 
scope  of  committee  work  and  makes  the 
following  flnal  repon  and  recommenda¬ 
tions  to  the  Standards  Committee.  The  re¬ 
pon  begins  with  a  diseussion  of  the 
general  objectives  of  the  header/desc^ 
tor.  and  then  presents  more  spedfie  objec¬ 
tives  seleaed  by  the  Task  Force  as  it 
developed  two  aitemative  implementa¬ 
tions. 

Both  of  the  proposed  implemenutions 
could  suppon  new  SMPTE  standards,  and 
are  described  in  tome  detail  here.  The 
‘ASN.l  Implemenution'*  it  struciured 
using  only  Abstract  Syntax  Noution  I 
(ASN.IU  an  existing  and  evolving  ISO/ 
CGTT  sundard  prindpally  used  in  the 
computer  industry.  The  ~CMpact  Impie- 
mentation'  it  d^gned  to  minimise  the 
number  of  bits  allocated  to  the  header/de- 
tCTtptor  function,  but  also  pennia  option- 
ai  use  of  the  ASN.J  noution  later  in  the 
header/desortptor  for  further  extensibility. 
Both  impiemenutions  perform  essentially 
identical  functions. 

Appendices  A  and  B  pKsent  illustrative 
approaches  to  the  de^  of  transport 
headers  and  header^eebding  software, 
respectively.  Transport  headers  are  de¬ 
signed  to  address  ceruin  difficult  dau 
transport  problems.  Appendix  C  lists  the 
offidaJ  task  force  members  as  of  January 
3. 1991 

In  view  of  (1)  the  great  imporunce  to 
industry  and  iu  customers  of  the  capabili¬ 
ties  provided  by  the  header/deschpton 
described  below,  and  (2)  the  degree  to 


which  these  two  possible  impiemenutions 
satisfy  the  objectives  established  at  the 
outset  for  beader/descriptets.  the  Task 
Force  recommends  that*  the  Standards 
Committee  arrange  for  the  preparation  of 
one  or  rwo  new  standards  for  digital  head- 
er/descriptors  based  on  either  the  ‘Com- 
paa”  or  the  “ASKl"  Impiemenutions 
described  beiow.  or  on  a  combination 
thereof. 

2.0  G«aeral  Objectives 

The  header/descriptor  task  force  was  di- 
reaed  to  consider  header/descriptor  ar¬ 
chitectures  and  implementations 
appr^riate  for  the  emer^g  digital  high- 
definition  television  (HDT^  and  high- 
resolution  system  (HRS)  tndustries.  The 
primary  design  objeetiva  of  the  task  force 
are: 

•  Universality  All  image  and  other  dau 
streams  should  be  labeled  so  that  signals 
can  be  shared  across  systems  and  applica¬ 
tions  with  minimal  degndation  or  confu¬ 
sion:  the  header/descriptor  should 
therefore  uniquely  identify  the  encoding 
scheme  emplo^d  and  how  the  dau  is  to 
be  interpreted. 

•  Longevi^  The  header/descriptor 
should  provide  a  number  of  potential 
identification  codes  adequate  to  serve  for 
decades,  and  preferably  centuries:  this  im¬ 
plies  that  specific  encoding  identifiers, 
ona  assigned  and  registered,  should  not 
be  reassigned  or  redefined.  The  header/ 
descriptor  should  also  faciliuu  loogevity 
for  equipment  and  media  of  all  types. 

•  Eaensibility  To  fodliuu  ser^  en- 
hancemem  and  innoution.  and  to  pro- 


mou  kmgmty  of  both  equipment  an 
recorded  signals,  the  huder/dcscripK 
should  accommodate  teduKfogieal  ad 
vanees  in  either  equipment  or  recordc. 
sipals  with  mmtmal  risk  of  obmietiag  as 
isdng  components,  inhastruetuic,  and  aae 
dia  collections. 

•  InuropfttbUity  The  header  sfaouk 
permit  optimal  sharing  of  dau  stream 
across  dau-generation.  carrier,  and  cqi^ 
ment  technologies  and  services  in  a  variet. 
of  error  eiwironmenu  and  should  penni 
all  equipment  and  applications  to  sueeesi 
fully  ignore  encrypted  or  otherwise  dett 
crately  inaccessible  data. 

•  Cost/performanee  effeaivtness.  Tb 
header/descriptor  should  permit  use  e 
both  low.cost  equipment  as  well  as  anr 
expensive  high-performance  equipmcni 
the  header/descriptor  should  aim  aceou 
modate  inexpensive  equipment  incapabt 
of  decoding  all  possible  dau  streana 
Eeonomy  and  simplieity  through  flexfoOir. 
and  scalability  of  the  k^  performance  pa 
iitneters  should  also  be  supporuble. 

•  Cempaemest  The  header/dcaaipte 
should  be  economical  in  iu  uriKation  o 
biu  and  should  typically  comprise  a  ncgli 
gible  fraction  of  the  underlying  dbu 
stream. 

•  Rapid  capture.  Much  video  and  othe 
serial  data  is  intercepted  midstream,  aud 
as  when  users  switch  to  a  new  channel  am 
therefore  the  header/descriptor  shouic 
permit  rapid  header  idendfiotion,  ade- 
^au  to  meet  the  needs  of  all  appUca- 
tfons. 

•  Editabiliiy  Common  editing  and  paiB> 
ing  operations,  such  as  splicing,  append¬ 
ing.  replacing,  inserting,  cropping,  and 
overiayv  should  be  supporuble  ^  the 
header/descriptor  architecture  without 
necessarily  requiring  decoding  and  encod¬ 
ing  of  the*  dau  stream  hselL 

3.0  Specific  Hcader/Descriptor 
Objectives 

To  meet  the  general  objectives  aimma- 
rued  above,  the  Task  Force  **** 

following  compact  set  of  specific  objec- 
tives  which  are  met  by  both  iiuplementa- 
tions  described  later.  The  header  and 
descriptor  are  defined  here  separauly. 

3.1  Specific  Header  Objectives 
The  specific  objectives  of  the  header  are 
to: 
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•  Identify  by  number  the  encoding 
ktandard  employed  by  the  atuched  block 
ui  data. 

•  Sper.f)'  the  length  of  that  block  of 
data,  so  that  e^ipment  of  any  epoch  can 
successfully  skip  untnteresung  blocks  of 
data  or  data  encoded  using  standards  de* 
fined  subsequently. 

•  Indicate  whether  a  readable  descrip* 
tor  follows  the  header. 

•  Permit  usen  to  intercept  dau  streams 
at  random  times,  as  when  switching  ehan> 
aels.  so  that  proper  dau  interpretation  be¬ 
gins  swiftly. 

•  Provide  optional  error-protection  ea* 
pabiliiy.  Oau  generation  entities  may  wish 
to  s^lement  error-proteoion  services 
provi^d  in  subsequent  environments  ex¬ 
perienced  by  that  data,  particularly  when 
those  environments  are  unknown. 

The  task  force  considers  these  anri- 
botes  of  the  header  to  be  the  minimum 
mandatory  set.  realizing  that  additional 
tmporunt  capabilities  can  be  provided  by 
she  descriptor. 

22  Examples  of  Header  Use 

A  Mmpic  example  illustrates  how  these 
minimal  capabilities  for  the  header  utisfy 
the  general  objenives  discussed  above. 
Suppose.'  after  many  years,  some  HDTV 
broadcasters  wish  to  provide  dual-lan¬ 
guage  sound  tracks.  This  capability  could 
be  provided  by  adding  to  the  dau  stream 
Wo^  of  dau  conveying  the  second  lan¬ 
guage.  These  new  blocks  would  be  labeled 
by  a  header  incorporating  a  sundard  num¬ 
ber  not  recognized  by  equipment  prty 
dueed  earlier.  This  older  equipment 
MNild  read  the  header  and  recognize  the 
sundard  identification  number  as  being 
unknown.  It  could  then  observe  the  length 
u>  the  associated  block,  and  skip  over  it  to 
the  next  header. 

Ail  dau  could  be  labeled  by  such  flexi¬ 
ble  headers,  or  only  a  designated  portion 
(e.g..  'auxiliary  data*)  of  a  more  rigidly 
de^ed  larger  video  data  stream.  Note 
that  HDTV'receivers  capable  of  receiving 
only  20  Mbtts/sec  could  not  accommodate 
mereases  except  at  the  expense  of  any 
spare  capacity  previously  reserved  for  ex¬ 
pansion.  or  by  the  broadcaster  reducing 
the  number  of  biu  conveying  video  or 
audio:  in  the  latter  ease  the  origmal  sun¬ 
dard  would  have  to  be  defined  so  as  to 
permit  receiven  to  aeeommodate  any  such 
real-time  video  or  audio  truneuioa  how¬ 
ever. 

2J3  Specific  Descriptor  Objectives 
The  principal  fiinaion  of  the  descriptor 
is  to  convey  additional  infonrution  that 
improves  the  usefulness  of  the  dau  to  the 
user  iu  format  would  be  specified  inde¬ 
pendently  of  the  nandard  employed  for 
the  dau  itself.  Such  optional  auxiUaiy  tn- 
CDrmation  in  the  descriptor  might  indude 
transport  mformaiton  such  as  crypto- 
gniphic.  pnonty.  or  additional  error-pro 


icetton  information,  as  well  as  source  time, 
authorship,  ownership,  restrictions  on  use. 
royalty  payment  information,  explicit  dc- 
scnpiion  of  encoding  or  decoding  pro 
cesses,  intermediate  processing 
performed,  and  other  information  in 
forms  that  could  evolve  over  the  years.  To 
simplify  the  decoding  task,  the  descriptor 
may  alM  contain  an  abbreviated  table  of 
mntenu  and  a  flag  indicating  whether  any 
Wbrmation  has  changed  since  the  pre- 
vi^  descriptor.  The  tanning  of  the  de¬ 
scriptor  would  also  indicate  the  descriptor 
length  so  that  it  might  be  skipped  without 
inttrpreution  if  the  user  chooses.  Option¬ 
al  additiOTul  error  protection  would  be 
available  for  dau  otiginatois  so  desiring  it. 

Specifically,  the  descriptor  could  in¬ 
dude: 

•  A  list  of  sundard-idemifieuion  mim- 
bers,  parameters  of  operation,  text,  and 
algorithms,  in  any  desired  combination. 

•  A  compact  optional  table  of  eonunis 
for  the  des^tor. 

•  A  flag  indicating  whether  changes  oc¬ 
curred  since  the  previous  descriptor. 

•  The  length  of  the  descriptor  so  that  it 
might  be  readily  skipped  if  desired. 

•  Information  indicating  the  number  of 
descriptor  entries  and  their  fomuts  so 
that  they  might  be  properly  interpreted. 

•  Optional  error  protection  for  the  de¬ 
scriptor. 

The  presence  or  absenee  of  a  descriptor 
eoufd  be  indicated  by  one  of  the  bits  con¬ 
tained  in  the  header. 

3.4  Examples  of  Descriptor  Use 

The  use  of  sundard  identification  num¬ 
bers  in  the  descriptor  permits  very  com¬ 
pact  and  flexible  encoding.  For  example, 
one  such  number  might  be  allocated  inter¬ 
nationally  to  each  m^el  number  of  uudio 
ulevision  camera,  so  that  subsequent  im- 
age  processing  can  maximally  improve  tm- 
a^  quality,  compensating  for  any  camera 
idiosyncrasies.  Similar  identifien  could  be 
used  for  different  forms  of  physical  aiu- 
lo^  or  digital  filtering  that  has  been  a^ 
plied  to  the  image  subsequently,  so  that 
user  equipment  might  again  appropriately 
refiiter  the  inuge  in  an  optimum  way  de*- 
pending  on  the  user's  intentioRS.  TU  is 
imporunt  because  performance  for  any 
panicular  display  device  or  audio  ^em  h 
best  when  it  reflects  the  processing  that 
has  occurred  previously. 

Authorship,  twmership,  and  other  such 
information  could  be  eonveyed  by  com¬ 
pact  suodard-identification  aumben,  or 
by  use  of  plain  test  in  English  or  another 
iMguage.  Ceruin  descriptors  might  sim¬ 
ply  be  numbers  indicating  the  settings  of 
certain  twitches  at  the  time  the  signals 
were  generated,  such  as  switches  control¬ 
ling  audio  bass,  treble,  or  volume.  Tbe  de¬ 
scriptor  may  also  include  subroutines  or 
other  encoded  instructions  that  fadliute 
subsequent  processing  or  decoding. 
Standards  numbers  and  parameter 


fields  can  also  be  used  to  suppon  trans¬ 
port-layer  functions,  including  i:Ms:nually 
all  forrns  of  enptography.  statemcr.i  of  the 
relative  priority  of  the  current  dau  block, 
priority  "bidding"  dau  (so  users  can  bid 
for  priority  in  a  free -market  sense),  syn¬ 
chronization  reinforcement  blocks,  and 
other  information,  the  character  of  which 
can  be  defined  over  the  years  as  new  sun- 
dards  identification  numbers  are  assigned 
and  as  new  languages  and  protocols  are 
defined.  Tbe  ineorporation  of  transport 
capabilities  in  this  sundard  should  not 
compromise  those  established  by  other 
layers,  but  would  merely  supplement 
them.  Tie  only  limiution  is  that  such  de¬ 
scriptor  trans^  standards  should  re¬ 
main  robust  if  block  sequences  are 
shuffled  in  another  transpon  layer. 

4.0  Illustrative  Examples  of 
Header/Descriptor  Use 

An  HDTV  broadcaster  could  simply  di¬ 
vide  the  HDTV  signal  into  blocks,  each 
beginntng  with  a  header  of  perhaps  6  to  16 
bytes  in  length.  This  header  would  contain 
the  length  of  each  block,  which  could  be 
fixed  for  ail  time  or  variable,  and  a  unique 
sundards  number  indicating  the  given 
HDTV  encoding  protocol  which  may  also 
be  unchanging  in  the  tniu'al  years.  If  the 
over;the-air  broadcast  standard  is  heavily 
error  protected,  little  additional  error  proh 
section  might  be  added  to  the  header. 
Good  engineering  practice  would  suggesL 
however,  that  the  header  be  tndependent- 
iy  error  protened  using  some  of  the  op¬ 
tions  described  later,  and  that  separate, 
and  powbiy  les  robust,  error  protection 
be  applied  to  the  remainder  of  the  dau 
stream. 

If  the  HDTV  channel  is  defined  so  as  to 
perform  ail  transpon  funaions.  including 
all  synchronization  and  error  eerreetion. 
then  the  header/descriptor  described  here 
might  be  embedded  in  the  transported 
dau  stream.  At  that  level  it  would  prefer¬ 
ably  be  used  to  encapsulate  all  data,  but 
could  be  used  in  an  tnferier  impiemenu- 
lion  to  encapsulate  and  characterize  only 
substream  or  side-channel  data.  In  this  ex¬ 
ample  too.  the  descriptor  might  convey  the 
origins  and  processing  history  of  the  data, 
enabling  future  higher-performance  ty»- 
term  to  employ  pon-filters  opumtztng  the 
quality  of  the  output  onages.  Such  fleidbil- 
hy  could  be  particularly  important  if  the 
output  display  capabilities  of  she  equip¬ 
ment  cnaMed  flexibiliiy  in  frame  rate, 
pixel  interpolation  routim  and  chromtn- 
anee  manipulation.  In  the  ease  of  audio 
signals,  descriptors  could  be  used  in  a  simi- 
Iv  manner  to  erubie  optimum  reproduc¬ 
tion  by  characterizing  mierephone 
piaeement  and  preprocessing. 

A  variety  oS  hcader/deseriptors  plus  as¬ 
sociated  dau  could  be  sequenced  in  any 
side  channel  to  prmnde  a  variety  of  flesble 
delivered  products,  such  as  multiple  audio 
channels.  multiple-langua|e  captioning. 
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home-shopping  order  infonnaiion.  and 
even  entire  TV  signals  encoded  at  lower 
quality  so  that  more  than  one  can  share  a 
single' channel.  In  this  example,  the  ulti¬ 
mate  llexibility  that  could  be  obtained  with 
individual  br^cast  channels  would  be 
heavily  dependent  upon  the  fkadbility  in¬ 
herent  in  the  broadcaster's  ability  to  de¬ 
crease  the  dau  rate  associated  with  the 
initial  broadcast  product  to  accommodate 
possible  growth  in  use  of  the  side-channel 
capability. 

It  is  also  possible  to  send  desatptor  in- 
formation  in  a  separate  blodt  with  its  own 
header,  rather  than  embedding  it  in  the 
same  blocks  as  the  associated  data.  In  this 
case  the  characteritadoo  of  a  signal  pro¬ 
vided  by  a  descriptor  could  remain  un¬ 
changed  for  many  blocks  until  that 
descnptor  information  changes.  In  the 
event  the  transport  layer  is  prene  to  shuf¬ 
fling  the  sequence  of  blocks,  impairing  as¬ 
sociation  of  descriptors  and  use  of  this 
data,  the  inherent  flesbili^  of  this  header/ 
descriptor  system  permits  incorporation  of 
sequence  numbers  in  blocks  so  that  the 
receiving  entity  can  properly  sequence 
them.  Such  separation  of  descriptor  infor¬ 
mation  into  separate  blocks  also  simplifies 
translation  between  environments  having 
-different  transport  layer  protocols:  any  de- 
•scriptor  elements  provithng  transpon-lay- 
er  fonetionality  can  be  added  or  deleted 
when  moving  between  such  environments, 
as  desired.  An  option  whereby  spedal 
*iransport  header/deseripiors''  are  pre¬ 
pended  to  blocks  is  described  in  Seoion 
5-17. 

To  the  extent  that  transpon-layer  func¬ 
tionality  might  be  embedded  within  the 
basic  data  block,  it  could  be  harmlessly 
overlaid  by  similar  transpon-layer  func¬ 
tionality  in  other  system  elemenu  without 
loss.  Thus  the  great  flexibility  inherent  in 
the  header/desenptor  architecture  pro¬ 
posed  here,  ineiuding  its  ability  to  perform 
multiple  funaions  in  a  multilayer  ISO  en¬ 
vironment.  should  not  be  a  handicap,  and 
may  in  some  applications  be  an  imponant 
advvuge. 

Funhermore.  this  header/deseriptor 
structure  permits  efficient  utilization  of 
the  standards  activities  of  a  wide  variety  of 
national  and  international  orpnizations. 
Every  sundard  developed  by  such  bodies 
for  eharaaerizing  or  communicating  digi¬ 
tal  information  can  be  characterized  by  an 
identification  code  that  can  be  conveyed  in 
an  efficient  manner  by  the  header/deserip- 
ter  system  described  here.  Since  the  im- 
plememations  proposed  here  call  for  each 
sundards  authority  to  have  its  own  idemi- 
fiation  number,  such  standards  bodies 
could,  in  this  'ID'  concept,  choose  to  use 
the  very  same  identification  numbers  they 
have  previously  chosen  for  other  purposes. 
Alternatively,  that  authori^  could  choose 
some  ether  simple  one-nxme  mapping 
between  numbers.  At  the  same  time,  such 
standards  an  also  use  this  header/descrip- 


tor  system  as  an  embedded  construct  with¬ 
in  their  own  protocols.  Although  the  lull 
power  of  the  flexible  extetuible  structure 
proposed  here  will  only  become  apparent 
as  it  is  developed  and  improved  over  the 
^ars.  the  basic  architecture  an  be  fixed 
iminediately.  This  would  permit  immedi¬ 
ate  fabnation  and  utilization  of  equip¬ 
ment  based  upon  this  standard. 

5.0  Compact  Header/Descriptor 
Approach 

5.1  Compact  Header/Descriptor 
Architecture 

The  header  is  divided  into  two  parts:  a 
2-byie  'header  key,"  and  the  remainder, 
or  'header  tail."  Among  the  available 
header  ^tions  are  those  without  tails,  cor¬ 
responding  to  a  I-byte  header  conveying 
simple  messages:  32  passible  messages  of 
this  type  are  available  for  future  defini¬ 
tion.  The  most  usual  function  of  the  key. 
however,  is  merely  to  'unlock"  the  tail  by 
providing  information  about  the  format  of 
that  uil. 

The  descriptor  is  divided  into  three 
pans:  a  2-byte  'desertpter  key"  (similar  to 
the  header  key),  a  'core"  of  0  to  8  bytes, 
and  the  'deseriptor  tail."  which  eonv^  a 
Kries  of  numbers  signifying  various  pieces 
of  information. 

5J  Header  Key 

Htader  Kty  Orfottizaiwn 

The  header  key  consists  of  two  4-bit 
fields  and  one  8-bit  field.  The  8-bit  field 
provides  2-bit  error-eeiTection  eapabiliiy 
for  the  key.  and  the  first  4-bit  len^-type 
'LT*  field  determines  the  length  of  the 
header  field  in  the  tail  that  eonuins  the 
length  of  the  block.  A  block  of  dau  is 
defined  as  comprising  the  header,  any  de¬ 
scriptors.  and  the  asaoeiaied  data.  The 
other  4-bit  'ID"  field  normally  determines 
the  number  of  bits  in  the  heaider-tail  field 
devoted  to  indicating  the  sundard  num¬ 
ber  under  which  the  remainder  of  the  dau 
in  the  block,  possibly  induding  the  de¬ 
scriptor.  is  encoded.  Most  descriptots 
would  be  publicly  readable,  however.  He 
ID  field  also  indicates  whether  a  readable 
descriptor  follows  the  header  sometimes 
one  might  wish  to  read  the  descriptor  be¬ 
fore  dedding  whether  to  decode  the  rest 
of  the  dau  block.  The  ID  field  cut  altema- 
tivefy  convey  mesuges  tot  eeruin  4-bit 
eombinatioas  m  the  fim  4-bit  LT  field.  In 
addition  to  specifying  the  length  of  the 
header  tail  fold  devoted  to  specifying 
Mock  length,  the  LT  field  also  determines 
what  level  of  error  protection  is  beiiy  pro¬ 
vided  for  the  header. 

5-2.2  Htadtr  Ktf:  4‘Bil  'Logsh-Type” 
LTHM 

The  primary  purpose  of  the  4-bit  LT 
field  is  to  specify  the  length  of  the  field  in 


the  header  utl  devoted  to  specifying  the 
block  length.  For  6  of  the  16  pouible  com¬ 
binations  proposed  here  (the  'fised- 
lengin"  options)  the  loul  length  of  the 
block  is  prespecified  so  that  do  biu  in  the 
header  uil  are  allocated  to  specifyin 
block  length.  The  ocher  10  proposed  com- 
binaiions  permit  use  of  1. 2. 4.  and  6  byscs 
for  specifying  the  block  length  in  b^  io 
inieger  format:  in  each  case  versions  wUi 
and  without  error-proteetioB  eapabflityfor 
the  header  are  available  to  the 
definition  community.  The  8-M  pnuo- 
don  provided  to  the  header  key  h  always 
presenL  however,  since  the  integrity  of  the 
key  is  crudaL  and  it  lepresentt  aieb  » 
small  pan  of  the  louL 
The  six  proposed  fixed-lengib  opdom 
are  as  foiled 

1.  The  block  is  2  bytes  long  and  the 
mesuge  is  conveyed  by  the  4.bit  !D  fiett 
16  messages  are  possible.  One  of  ihne 
mesuges  could  si|pify  that  the  test  of  She 
header/deseriptor  is  coded  in  ASN.1  (dual 
compact-  and  ASN.l-beader  desoding  ca¬ 
pability  would  be  necessary  for  all  equip 
ment.  however,  but  Appendix  B  mbbsb 
this  might  not  be  burdensome). 

1  Same  as  (1),  except  that  an  addidoih 
ai  16  messages  are  po«bie  (for  a  soul  of 
32  2'byie  options). 

3.  He  block  is  4  bytes  long,  the  mes- 
ttge  consisting  of  the  4  ID  bis  phs  2 
bytes,  a  total  of  20  bits. 

4.  The  Mock  length  is  6  bytes:  a  Ms 
(44-3x8)  are  available,  the  mnwinii 
byte  providing  error  protectioa  for  the  iaai 
4-byte  UL 

5.  The  block  length  is  6  byia  with  56 
biu  of  information  (4*4x8)  being  ivafr 
able,  but  without  additional  error  protec- 
lion. 

6.  He  length  of  the  Mock  is  unknown 
or  irreievanL 

In  each  of  the  foregoing  options,  oeepi 
the  flnt  twa  the  available  bin  cu  be  & 
vided  in  a  yet-to-be-determined  way  be¬ 
tween  those  indicating  the  sundard 
idena&ation  number  and  any  nddhionnl 
mesuge.  He  propoud  SMPTE  standvd 
would  constnin  only  options  4  and  5,  aVm 
eating  the  fim  8  biu  to  designating  the 
sovereign  sute.  so  that  deveiopmem  of 
theu  6-byte  options  can  proceed  without 
mtemational  agreemenL  Dcsignadoa  of 
sundards  bodies  and  sovereign  mes  is 
discusud  ftmher  in  Section  5.14.3. 

The  ten  remaining  proposed  opdooi  for 
the  LT  fieid  provide  for  either  1. 2.  A  or  6 
bytes  in  the  header  uil  to  be  allocated  so 
Reifying  the  block  length  in  bytes,  aiwnjs 
in  inuger  format.  Depending  on  whiefa  of 
theu  ten  options  b  ehoun.  additional  in¬ 
formation  it  also  conveyed  eonceming  the 
level  of  error  proteoiM  for  the  header. 
Theu  propoud  options  are  as  fellows: 

7.  One-byte  field  in  the  header  tnfl 
spedfies  block  length  in  integer  fermw  no 
additional  erior<oneaion  capabiUiy  is 
provided  to  the  header.  Blocks  up  lo  2S6 


SMPTS  Journal,  Juna  1992 


413 


bytes  long  are  available  under  this  option. 

tL  Same  as  (7),  except  an  additional  1 
byte  is  provided  for  header  error  proteC' 
lion. 

4.  Same  as  (7),  except  that  3  bytes  spe¬ 
cie  block  length:  the  maximum  block 
length  here  is  64  Kbytes. 

10.  Same  as  (9),  with  an  additionai  byte 
(inr  header  error  protection. 

11.  Same  as  (7).  ex^t  that  4  bytes  spe¬ 
cif  block  length,  which  can  approach  4 
billion  bytes. 

12.  Sme  as  (11).  except  that  2  bytes  of 
header  error  protection  are  added. 

13.  Same  as  (7).  except  that  6  bytes  spe* 
ciiy  block  length. 

14.  Same  as  ( 13).  except  that  2  bytes  of 
header  error  protection  are  added. 

15>16.  To  be  determined. 

5JJ  Hmdtr  Ktr  ^-Bit  ID  FuU 
When  the  4^‘t  ID  field  is  not  being 
used  to  convey  a  message  using  LT  options 
I  to  6.  then  the  fiist  3  bits  of  the  ID  field 
indicate  one  of  eight  possible  lengths  for 
the  header>tail  field  conveying  the  stan* 
(lard  identification  number,  and  the  fourth 
ID  bit  indicates  whether  a  readable  de* 
scriptor  tbilows  the  header.  These  pro 
ptrsed  eight  laiMength  field  options 
include  1. 3.  4.  and  S-I^e  options.  These 
eight  uiMength  options  proposed  for  the 
ID  field  are  as  follows: 

1.  One*byte  sundard  identification 
number  allcated  internationaily. 

Z  Same  as  (1),  but  providing  for  2S6 
additional  possible  sundards. 

3.  Similar  to  ( 1 ).  except  that  the  stan* 
dard  field  in  the  header  tail  contains  2 
bytes  instead  of  I.  providing  lor  64.(X)0  in* 
teinational  standards. 

4.  Similar  to  ( 1).  but  with  a  3*byte  field 
in  the  header  tail,  providing  for  over  16 
million  international  standards. 

5.  Two  bytes  in  the  header  tail  indicate 
the  sundard  identification  number,  the 
first  byte  indicating  the  sovereign  sute 
(see  Sraion  5.14J). 

6.  Similar  to  (5),  except  that  4  bytes  arc 
available  (one  for  the  sovereign  sute ). 

7.  Similar  to  (5),  except  that  d  bytes  are 
available. 

&.  To  be  determined,  or  reserved  for 
the  distant  future. 

These  first  four  options  provide  1.  2. 
and  3*byte  sundard  identification  num* 
ben  to  some  designated  international 
sundards  body  or  bodies.  Two  versions  of 
the  l*byte  option  are  prcMded  because 
ocherw^  too  few  eombiiutions  would  be 
available.  T)ie  remaining  options  prmhdc 
I  byte  (or  more)  that  identifies  the  t(x«r* 
eign  sute  under  whose  authority  the  re* 
maining  sundard*identifieation>number 
bytes  the  ID  field  have  been  assigned  (1.3. 
or  7  bytes  remain).  Note  that  receiving 
equipment  would  generally  net  distinguish 
between  sovereign  sute  identifien  and 
standard  identification  numbers:  they 
would  be  treated  together  only  as  a  single 


merged  number  that  wa.s  known  or  un* 
kno^. 

S^.4  Header  Tail 

5.14.1  Header  Tail  Orianaation 

The  header  uil  conuins  one  field  for 
tndieating  the  block  length  in  integer  for* 
mat.  one  field  for  the  standard  ide'ntifica* 
tion  number,  and  optionai  fields  for  error 
proieaion.  all  hathng  been  discussed 
above. 

5.14.2  Header  Tail:  Standard  ID 
Organaaiion 

The  Sundard  ID  comprises  two  pans: 
the  sovereign  identification  number,  and 
the  sundard  identification  number,  de* 
achbed  in  Sections  5.2.4J  and  5.2.4.4. 
respectively. 

5.14.3  Header  Talk  Sovereign  State 
Identification 

Eight  bits  is  sufficient  to  designate  the 
authorizing  sovereign  sute.  even  if  the 
number  of  .sovereign  sutes  exceeds  256: 
the  trick  Ls  to  subdivide  cenain  undesig* 
nated  sovereign  sute  identifiers  by  bor¬ 
rowing  bits  from  the  remaining  sundard 
identifier  field.  For  example,  by  deferring 
to  the  United  Nations  the  task  of  defining 
sovereignty,  it  should  be  an  easy  matter  to 
assign  all  present  U.N.  membm  unique 
identification  numbers  in  alphabetical  or¬ 
der.  and  to  assign  the  next  numbers  in 
order  of  membership  admission  to  sutes 
not  replacing  member  sutes  who  are  al¬ 
ready  members.  Once  224  member  sutes 
cxisL  new  U.N.  members  would  share  the 
last  32  numbers.  Within  these  32  sovereign 
sute  "condominium'  identifiers,  an  addi¬ 
tional  3  bits  would  be  borrowed  from  the 
Sundard  Identifier  field,  permitting  8  new 
member  states  per  condominium,  or 
2S6-32*r(32xM)w4tl0  possible  sutes. 
Of  these  .state  designators.  32  would  be 
assigned  to  existing  sundards  bodies.  In 
the  unlikely  event  more  sutes  are  created, 
still  more  bits  can  be  borrmved.  permitting 
unlimited  growth. 

5.14.4  Header  Taik  Standard  Identifier 

The  Sundard  Identifier  would  be  se* 
leoed  by  the  indieated  Sovereign  Sute  or 
International  Sundards  Body  using  exist¬ 
ing  procedures.  To  the  extent  sundards 
already  have  unique  identification  mim* 
ben.  those  same  numben  could  be  used 
here.  To  the  extent  they  do  not.  eaeh  sun¬ 
dards  body  should  map  their  sundards 
into  numbers,  preferably  a  eompan  or 
eonseeutive  scl  Most  new  standards  might 
he  introduced  under  long  numben  associ¬ 
ated  whh  minor  sundards  bodies,  while 
standards  achieving  wide  actcmance 
could  be  renamed  with  short  in... .  ...e 

bits  can  be  berrmved  from  the  Sundard 
Identifier  field  almost  indefinitely,  the  syv 
tern  would  permit  in  certain  cases  (the 
longer  header  options)  for  ever^  itauvid* 


ual  ever  bom  to  become  a  sovereign  defin- 
er  of  standards  under  some  state's 
authority  (the  8-byte  option  provides  each 
sovereign  state  with  70  million  billion 
numben).  In  the  same  way.  a  sundards 
bodv  could  allocate  numben  well  spaced 
nunierically  so  that  their  least  significant 
bits  could  become  "user-allocated  bits." 
functioning  in  lieu  of.  or  in  addition  to.  a 
descriptor. 

Note  that  the  cost  of  this  kind  of  flcxibil- 
itv.  as  well  as  the  other  sons  of  flexibilitv 
described  earlier,  is  essentially  negligibie 
given  the  extensible  nature  of  this  header 
architecture.  That  is.  the  price  of  using 
long  headen  is  paid  principally  by  those 
who  need  thetiu  while  most  usen  will  pre¬ 
fer  and  use  the  shorter  options. 

5.2J  Header  Organaation  Illustrations 

These  options  can  be  represented  pkto- 
rially.  The  16-bit  composition  of  the  head¬ 
er  k^  consists  of  three  pans:  the  4  bits  in 
the  length-type  LT  field  (I),  the  4  biu  in 
the  ID  field  (i),  and  the  8  core  error- 
proteaion  bits  (p);  these  protea  only  the 
key.  The  header  key  always  eonsists  of: 

Ulluiwppppepp. 

If  we  designate  this  2-byTe  key  by  the 
symbol  K.  and  the  bytes  representing  the 
block  length  field  by  L.  the  bytes  repre¬ 
senting  the  standard  identification  num¬ 
ber  by  S.  and  the  header  parity  bytes  by  P, 
then  for  LT  options  1  and  2  we  have  only  AT 
(2  bytes).  For  LT  options  3.  4,  and  S.  we 
have  only  KSS  (4  bytes),  KSSSP  (6  bytes), 
and  KSSSS  (6  bytes),  respeatvely.  In  this 
case  of  extremely  short  blocks,  the  sun¬ 
dard  identification  numbers  SSS  and 
SSSS.  combined  with  the  4  bits  in  the  ID 
field,  would  generally  convey  messages  in 
their  own  right,  and  could  also  be  used  for 
a  variety  of  trannakmal  and  trattsport 
fiinaioru.  LT  option  7  would  permit  the 
following  possibilities:  KL5.  KLSS. 
KLSSS.  KLSSSS.  and  KUSSSSSSS. 
where  the  number  of  bytes  allocated  to  the 
Standard  Identification  number  by  is 
specified  by  the  3  bits  in  the  ID  ^Id  (see 
5.2J).  One  likely  option  for  HDTV  m^t 
be  LT  option  12.  combined  with  ID  option 
I.  r^resented  as  KLLLLSPP  and  eem- 
prising  9  bytes.  The  sundard  number  oon- 
uined  in  field  S  would  be  an  intemational 
sundard.  U  such  an  intemational  sun¬ 
dard  were  devised  to  have  block  lengths  no 
larger  than  64  Kbytes,  and  if  l-bit  error 
eorrection  were  a^quate.  then  a  6-byie 
HDTV  option  is  available:  KLLSP.  Such 
short  5  or  ^byte  header  options  could  of¬ 
ten  be  employed  for  nonvideo  infotma- 
tion.  For  example.  KLSS  (5  bytes)  would 
accommodate  64i)00  possfole  intemation¬ 
al  sundards  employing  dau  up  to  256 
bytes  per  block. 

5J.f  Header  Arekiieentre:  Eguiptnent 
bmpUeations 

Equipment  interpreting  such  headers 
can  be  particularly  simple.  Since  it  normal- 
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N  would  be  reading  a  stream  of  blocks, 
and  should  know  when  a  block  begins,  it 
could  simpiv  jump  to  one  of  the  WK  words 
spenfied  by  the  16-bit  header  key.  these 
«vords  indicating  the  location  of  the  bytes 
specifying  the  block  length.  An  example  of 
an  aJ^nihm  to  perform  these  header  de¬ 
coding  functions  appears  in  Seaion  5.2.8. 
and  an  illustrative  program  coded  in  C  ap¬ 
pears  in  Appendix  B.  Simpler  e^ipment 
might  simply  look  at  the  first  ^  bits  of  the 
LT  field,  fom  which  the  same  infonnation 
can  be  deduced  in  an  error-free  environ- 
ment  Equipment  of  tmennediate  com* 
plesty  can  use  the  key  enor-protection 
^btts  to  an  intermediate  ^gree.  The  ID  bits 
immediateiy  indicate  the  field  where  the 
Standard  Identifier  is  located,  and  equi^ 
ment  should  eompare  this  to  a  list  of  stan¬ 
dards  it  prepared  to  process.  After  any 
indicated  proeeuing.  the  equipment 
moves  directly  to  the  next  header  at  the 
specified  nuniber  of  bytes  along  the  dau 
stream. 

Should  synchronization  be  lost  it  could 
readily  be  recovered  in  most  eases  of  inter¬ 
est  For  example,  in  the  9-byte  HDTV 
header  example  above,  these  9  bytes 
would  probably  never  change  within  a  sin¬ 
gle  broadcast  program.  HDTV  broadcast 
tcceiveis  would  simply  scan  the  dau 
stream  for  that  particular  9-byte  sequence, 
which  could  be  used  as  a  traditional  syn¬ 
chronization  block.  This  would  work  even 
if  multiple  header  types  were  being  inter¬ 
leaved.  Accideniai  synchronizations  (veiy 
rare)  in  the  dau  block  would  be  recogniz¬ 
able  because  the  indicated  false  block 
length  would  probably  not  lead  to  a  valid 
header. 

The  most  direct  use  of  these  header/de- 
seripinrs  would  be  as  a  series  of  bytes  at 
the  Stan  of  each  data  block,  where  the 
data  blocks  are  then  concatenated  in  a 
comnu-less  string  of  bits.  Other  uses  could 
also  be  made,  however.  For  example,  a 
panicular  transpon  scheme  (e.g.  over- 
the-air  HDTV)  might  package  data  blocks 
in  diseontinuous  but  fixed  form  within  a 
larger  data  stream,  this  stream  being  char¬ 
acterized  by  the  embedded  header/de- 
scripton.  A  header/deschptor  could  also 
characterize  (say)  each  (fame  of  an 
HDTV  sequence,  and  the  same  header/ 
descriptor  technique  could  also  be  used 
within  each  (fame  to  communicate  details 
about  hs  intemal  structure.  Such  nesting 
of  header/descriptors  does  not  impair  ^ 
ehronization  much,  although  an  initial 
fobe  synchronization  could  occur  within  a 
larger  block:  this  would  be  quickly  detect¬ 
ed  if  the  presumed  block  ended  without  a 
valid  header  following  it.  Tite  search  for  a 
valid  header  could  then  resume. 

SJ.?  Htadtr  Ankiuaurr  Tmufort 

If  one  wishes  to  incorporate  synchreni- 
ation  augmentation,  extra  error  protec¬ 
tion.  block  prioritization  infonnation.  or 


cryptography  for  the  header  or  descnpior. 
it  must  either  be  defined  at  the  outset  in 
the  header/descripior  definitions,  or  its  in¬ 
corporation  becomes  standard-specific  — 
but  how  could  we  know  the  stan^rd  if  we 
were  not  synchronized  or  error-free,  etc.? 

For  example,  in  high<rror-rate  envi- 
ronmenu  users  may  wish  to  provide  their 
header/descriptors  with  more  synchroni¬ 
zation  infonnation  or  error  protection 
than  is  available  in  the  basic  definitions. 
Fortunately,  much  additional  transport- 
related  infotmation  can  be  conveyed  effi¬ 
ciently  through  repetition  of  short  blocks. 
One  simple  approach  is  to  insert  brief 
bunts  of  shon  2-byu  headers  into  the  dau 
stream,  where  the  synchronization  powen 
of  these  2-byte  header  blocks  are  eumula- 
ttve:  the  greater  the  anddpated  channel 
noise,  the  longer  these  bunts  should  be. 
Although  the  software  of  some  reeeiven 
may  not  be  sophisticated  enough  to  syii- 
chronize  such  bursts  well  this  approach  is 
standard  independent,  and  so  such  soft¬ 
ware  could  be  designed  today.  Such  a 
burst  inserted  anywhere  in  a  sequence  of 
blocks  permits  synchronization  of  the  en¬ 
tire  stream.  Since  2-byie  headen  can  cor¬ 
rect  2  biu  in  16.  synchronization  bursts  fait 
only  when  the  bit-error  rate  continually 
exceeds  -0.1  In  still  higher  noise  environ¬ 
ments  where  ii  is  known  such  2-byie  bursts 
might  be  employed,  ihe  receiver  can  either 
autocorreiate  the  .signal  or  correlate  it  with 
potential  16-bit  synchronization  words: 
this  could  provide  synchronization  for 
nearly  any  bit-error  rate,  provided  the 
burst  length  was  sufficiently  long. 

Similar  issues  arise  when  extra  error 
protection  for  the  header/descriptor  is 
desired.  If  the  nature  of  such  protection  is 
defined  only  in  a  dau  or  sundard  ID  area 
that  cannot  be  read  unless  error  protec¬ 
tion  is  employed,  the  data  is  generally  in¬ 
accessible.  One  option  is  for  the  user  to 
generate,  as  above,  bursa  of  shon  blocks 
that  convey  primarily  the  identity  of  the 
desired  error-protection  scheme  em¬ 
ployed  in  the  longer  data  Mocks.  Such 
standards  could  be  defined  so  that  they 
are  presumed  operative  for  all  following 
data  until  **tumed  olT  by  another  com¬ 
mand.  AJtematively  one  of  the  shon 
blocks  (say  2  bytes)  could  be  employed  to 
indicate  that  error  protection,  cryptogra¬ 
phy.  or  other  such  schemes  were  to  be 
conveyed  in  a  *num>on.  turo-ofl**  foshion: 
a  sepmie  shon  Mock  could  be  used  to 
convey  the  opposite  message.  Such  a  rep¬ 
etitious  Kries  of  shon  blocks  can  also  be 
well  protected  and  synchronized  in  essen¬ 
tially  any  reasonable  error  environment 
(BER  <01)  using  the  existing  defined 
header/  descriptor  options.  In  this  ease  the 
user  would  need  to  know  only  the  defini¬ 
tion  of  the  chosen  ertor-piMeetion  ID 
number.  Such  error-protection  schemes 
and  ptetoeoto  can  be  defined  in  simple 
ways  over  the  years. 

Yet  another  similar  problem  involves 


the  possible  incorporation  of  block  prior¬ 
ity  infonnation.  For  example,  if  some  dau 
pnxessing.  transmissioa  stonge.  or  dis¬ 
play  step  cannot  handle  all  the  data,  the 
originator  of  that  dau  might  wish  to  tell 
the*  user  which  dau  is  more  expendable. 
Yet  the  user  might  wish  to  do  only  mini¬ 
mal  decoding  to  determine  priority.  Al¬ 
though  such  information  would  thm  be 
descnptor-standard-speeific.  one  or  more 
descriptor  sundard  IDs  could  be  defined 
in  such  a  way  as  to  convey  relative  Mock 
priority,  uy  on  a  scale  from  1  to  10.  Suds 
kifotmatton  could  also  be  embedded  id 
the  data  itself,  but  this  would  generally 
require  the  user  to  do  more  decoding  b^ 
fore  dtsearding  any  block.  Although  con¬ 
fusion  couid  arise  because  multiple 
prioriiy-labeiing  descriptor  schemes  n^bt 
arise,  they  would  all  have  standard  aaa^ 
beis  which,  in  prin^ie,  could  be  aceemed. 
Seetien  5.4  describes  another  approech. 
which  can  be  used  in  paralleL 

Although  this  discussion  has  not  been 
exhaustive,  it  does  suggest  that  the  pi» 
poaed  header/descriptor  definidon  bni 
great  flexibility  for  handling  a  verieiy  of 
problems  laced  when  it  must  supplement 
or  provide  transpon-layer  or  other  09 
layer  functions. 

SJJ  Campoa^Hmdtr  Dteodint 

AifprUhm 

Fbr  stmpiieity  here,  we  asntme  dm 
block  has  been  synchronized  and  that  the 
header  key  has  been  eiror<otreGted.  per¬ 
haps  by  using  a  16-bit  dispateh  table.  We 
also  assume  the  equipment  is  provided 
with  a  list  of  sundard  IDs  that  k  kmws 
bow  to  interpret  as  well  as  a  much  shorter 
lin  of  ID  length  fields  cetTcspooding  in 
these  sundards  (possible  values  of  8C.  de¬ 
fined  below). 

1.  Dispatch  on  the  first  byte  (236 
tkms):  return  with  4  integers: 

/  a  iengthof  length  field  (O^.O-^.L 
2.  4.  or  6  bytes) 

J  •  block  length  E  bytes  if/  *  0(2.4L 
or  6  bytes) 

K  a  length  ofID  field  in  tail  (LZ  4. 
Z  or  32  bytes) 

L  a  presence  of  descriptor  (0  or  1) 

Z  If  /  a  0.  let  block  length  B  •  J 

Z  If  /  a  0>.. dispateh  (Table  1:  nn 
presented  here)  onAf.  biaftoSoftbe 
header 

Imeipret  resulting  mesage  and  go  n 
IZ  or  skip  directly  to  12  if  mesuge  un- 

4.  If  /  a  04-.  dispatch  (TaMe  Z  not 
presented  here)  on  Ma  5  to  8  of  header. 

Interpret  resulting  mesuge  and  go  to 
IZ  or  skip  directly  to  12  if  message  en- 

5.  U/ a  0.  read  / bytes  (yieldinf  Mock 
length  5).  surung  at  Mt  17  of  header. 

Z  If  ID  field  length  AT  not  on  liB  of 
known  ID  Geld  lengths,  go  to  IZ 

7.  Read  ID  field  of  length  K  byics. 
vuntng  at  bit  17  -  8/. 


SMPTE  Journo,  -kew  1892 


41B 


8.  If  ID  is  not  on  list  of  known  IDs.  then 
go  to  11 

9.  If  I  »  I.  read  descriptor  length  D 
(pan  of  descriptor  decoding  algorithm^ 
not  desenbed  here). 

la  IfL  «  0,  ihenD  «  0. 

11.  Co  to  algorithm  specified  by  ID  and 
execute  over  a  block  of  S  bytes,  starting  at 
the  end  of  the  descriptor  k\t  ^  K  ^  D 
bytes. 

’  11  Jump  to  end  of  block  (5  bytes  long) 
and  read  next  header. 

5J  Descriptor  Specification 

SJJ  barodumom 

Pubiidy  readable  descriptors  may  or 
fRay  not  be  moorporated  in  any  dau  block, 
as  indicated  by  one  of  the  bits  in  the  head* 
er  key.  They  would  convey  auxiliary  infer* 
madOT  coaming  the  nature  of  the 
asodated  data,  such  as  authorship,  distri* 
buiofship.  ownership,  intcilcaual  proper* 
ly  restrictions,  sampling  patterns,  filtering 
employed,  color,  nonlinearities.  etc  This 
information  would  generally  be  in  the 
form  of  identification  numbers  assigned  by 
sandards  bodies,  although  opti^  for 
conveying  text,  programs,  or  other  dau 
wiHjld  Like  the  header,  descriptors 
wouidJndicate  their  length  so  that  equip* 
ment  could  skip  past  if  desired,  and  they 
%Mould  have  optional  provisions  for  error 
proienion.  An  efficient  ASN.l  equivalent 
for  the  descriptor  definition  proposed 
below  could  also  be  developed. 

iJJf  Daeriptot  Artkiiecttiff 

The  descriptor  is  divided  into  three 
pans:  a  S-byte  -key."  a  "core"  ranging 
from  0  to  8  ^es.  and  the  *uir  of  length 
defined  by  the  core.  The  key  unlocks  the 
core,  whi^  defines  the  length  of  the  de¬ 
scriptor.  an  indication  of  the  nature  of  the 
contents  of  the  descriptor,  and  the  nature 
of  any  optional  error  protenion  for  the 
core.  The  tail  consists  of  a  series  of  de¬ 
scriptor  identification  numbers,  similar  in 
concept  to  the  standard  identification 
numbers  provided  in  the  header.  For  each 
camera  nonlinear  luminance  map¬ 
ping.  movie  producer,  filtering  algorithm, 
foyalry  payr^t  procedure,  etc.,  there 
could  be  a  separate  descriptor  number  as¬ 
signed  or  rtf^ered  by  appropriate  stan* 
dards  bodies,  indicated  in  a  manner  also 
similar  to  that  of  the  header.  To  properly 
convey  this  list  of  descriptor  identification 
numbers,  the  oil  also  conuins  fields  giving 
the  number  of  such  descriptor  standards, 
the  length  and  type  of  each  such  standard 
number,  and  the  nature  of  arty  optional 
error  protenion  employed.  The  tail  also 
supporu  delKcry  of  fiel^  of  text  m  any  of 
a  large  number  of  languages,  such  as  &g* 
lish  or  Pont*,:  cse.  as  well  as  computer 
languages  su^  as  C  Postscripi,  etc.  Such 
software  elements  would  pemit  the  de¬ 
coding  procedures  to  be  defined  explicitly, 
if  desired. 


A  f»(entiai  area  for  future  improve¬ 
ment  is  development  of  a  more  universal 
subset  of  descnpior  clcmcnis  for  wide¬ 
spread  usage.  It  would  include  parameters 
such  as  rwlution.  raster  definition,  bit 
packing,  etc. 

SJJ  Dtseriptot  Ktj  Difinition 

SJJJ  Descriptor  Key  Archiueture 

Tile  descriptor  kev  eonasts  of  three 
pens:  (1)  a  4.bii  “type'  field  that  eharac- 
terra  the  contents  of  the  descriptor,  (2)  a 
4-bit  “length  type'  field  that  indicates  the 
format  of  the  descriptor  length  field,  and 
(3)  an  S-bit  “protection"  field  for  the  key. 

Descriptor  Key:  4»Bit  ^Type "  T 

Field 

The  purpose  of  the  7  field  is  to  indicate: 
(1)  whether  or  not  this  descriptor  conuins 
a  public  index  (1  bit);  (2)  whether  the  de¬ 
scriptor  length  field  in  the  core  (if  any)  is  2 
or  4  bytes  long  in  integer  format  (1  bit): 
and  (3)  which  of  four  error-protection  op¬ 
tions  are  being  employed  (2  bits).  The  four 
descriptor  error-protection  options  are; 
(I)  no  protenion.  (2)  protected  cere  plus 
unprotened  uiL  (3)  both  core  and  uil 
protened.  and  (4)  both  core  and  uil  dou¬ 
bly  prouned. 

5J.3J  Descriptor  Key:  4»Bii  “Length^ 
Type'  DLT  Field 

This  field  contains  the  length  of  the  de¬ 
scriptor  after  the  core,  in  bytes,  unless  iu 
contents  are  “zero,  zera  zero,  zero'  (for 
descriptor  uiis  longer  than  16  bytes),  in 
which  case  the  length  is  given  by  the  core 
in  a  field  which  is  either  2  or  4  ^es  long, 
as  specified  in  the  7  field  (see  5  JJ.2). 

S.2.3.4  Descriptor  Kev:  S-Bit  'Froteaien' 
F  Field 

The  funnion  of  the  P  field  s  identical  to 
that  of  the  d-bii  error-protenion  tleid  of 
the  header  key:  it  protects  the  2*byTe  de¬ 
scriptor  key  only. 

SJ.4  Descriptor  Core  Defimition 

5J.4.1  Descriptor  Core  Architeaurt 

The  descriptor  core  consists  of  three 
pans:  (Da  field  defining  the  descriptor 
length  (0  to  4  bytes):  (2)  a  field  indteating 
the  nature  of  the  contents  of  the  deserip- 
lor  (0  or  2  bytes):  and  (3)  an  optional  pr> 
tcetion  field  for  the  core  only  (0. 1.  or  2 
bytes).  The  toul  length  of  the  descriptor 
core  thus  runges  bci^n  0  and  8  byus. 

5.3.4.2  Descriptor  Length  Field 

This  field  is  of  length  zero  if  the  descrip¬ 
tor  length  specification  has  been  preempt¬ 
ed  by  the  length  field  in  the  descriptor  key; 
otherwise  it  is  either  2  or  4  bytes  bng  fo 
integer  format  as  determined  by  one  of 
the  biu  in  (he  7  field  of  the  descriptor  key 
(see  5JJ.2).  The  length  of  the  deseripior 


field  as  presented  in  the  core  is  defined  as 
indudtng  ail  the  bytes  in  the  descriptor, 
including  those  in  the  key.  core,  and  uiL 

5.3.4.3  Descriptor  Core  Contenu  Indec 

This  field  contains  0  or  2  bytes,  as  indi¬ 
cated  by  one  of  the  bits  in  the  descriptor 
key  7  field  (see  i  J  J.2).  In  its  2-byte  form 
it  indicates  whether  or  not  the  following 
descriptor  conuins  information  concern¬ 
ing  any  of  16  categories  of  information 
about  the  data  stre^  Among  others, 
these  categories  of  tnformadon  indude 
synchronization  reinforcement,  error- 
protection  dau-  enc^don  k^  packet 
priorities,  authorship,  distributotship, 
(line  or  date  of  any  event,  ownership,  intei- 
lectual  property  restrictions,  sampling  pat- 
urns,  filtering  history,  color,  nonlinear 
mapptnp  employed,  etc  The  last  bit  of 
the  index  is  zero  if  this  descriptor  is  the 
same  as  the  previous  one  assodaud  with 
the  same  header  ID.  Tite  purpose  of  this 
contents  index  is  to  spare  equipment  the 
burden  of  decoding  descriptors  when  their 
eentena  may  be  of  no  interest  This  is  par- 
tkuiariy  so  when  a  long  sequence  of  de¬ 
scriptor  eiemenu  is  repeated  periodieaily 
to  aid  certain  users  having  only  segments 
of  the  dau  stream  available  to  them. 
Users  of  longer  segments  could  therefore 
ignore  such  data  more  readily. 

5.3.4.4  Descriptor  Core  Ferity  Prouaion 

This  field  would  conuin  0. 1.  or  2  bytes, 
as  indicated  by  two  of  the  bits  in  the  de¬ 
scriptor  key  7  field  (see  5322),  These 
bytes  would  protect  the  core  only,  using 
e^es  similar  to  those  employed  in  the 
header. 

SJJ  Descriptor  Toil  Definition 

5.3.5. 1  Descriptor  Tad  Architecture 
The  descriptor  uil  consists  of  four 
fields: 

1.  The  eiement-nufflber  field,  a^ich  in- 
diotes  the  number  of  independent  de¬ 
seripior  identification  numbers  contained 
in  this  descriptor,  tu  length  ranges  from  4 
bits  to  a  manmum  of  2  bytes. 

2.  The  descriptor  element  kngth-iype 
field,  which  specifies  the  lengths  of  each  of 
the  descriptor  identification  numbers  oon- 
taitud  in  the  following  field,  together  with 
their  respective  types:  these  typn  indude 
identification  number  types  similar  to 
those  employed  for  rndkating  sundard 
numbers  in  tte  header,  as  well  as  support¬ 
ing  transmitunee  of  us  and  computer 
programs:  iu  length  is  typically  2  to  4  bits 
per  descriptor  identification  number. 

3.  The  descriptor  identification  num- 
her  field,  which  consists  typically  of  one  or 
more  deseripior  standard  numbers,  each 
in  a  1  to  (hbyie  format  or  appearing  as  a 
sequence  of  us  or  code:  the  total  length 
of  this  field  approsmates  several  bytes  per 
descriptor  element,  or  substantially  more 
if  tes  or  code  is  incorporated. 
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4.  Tite  proteeiion  tieid.  as  detined 
jointly  by  the  protection  option  indicated 
in  2  of  the  T  bits  (see  i  J  in  the  de« 
seriptor  key  combined  with  the  indicated 
descriptor  length:  longer  descriptor 
lengths  would  require  more  bytes  of  pro* 
tecfion  for  any  indicated  level  of  protec* 
tion. 

5.J.S.2  Descriptor  Tail  ElemeniSumber 
Field 

This  fields  consists  of  one.  twa  three,  or 
four  4i^t  words  indieadng  the  number  of 
descriptor  elements  contained  within  this 
descriptor.  Each  4-bit  word  contains  2  bits 
indicating  the  number  of  elements,  and  1 
bit  indicating  whether  or  not  an  additional 
4-bit  word  is  appended,  up  to  a  maxifflum 
'  of  four  words.  Thus  one  4-bit  word  will 
suffice  for  0  to  7  deseri|»or  types,  which 
normally  should  be  sufficient.  Two  concat¬ 
enated  4-bit  words  offer  up  to  2  exp  6  •  64 
possible  elements.  Three  words  can  ac¬ 
commodate  up  to  512  elements,  while  use 
of  all  four  words  (2  bytes  i  can  aecomm> 
date  more  8(XX)  elements,  which  should  be 
sufficient  and  is  the  maximum  numoer  per 
header  allowed  under  this  protocol. 

5.J.5.J  Descriptor  Tail  Element  Length- 
Type  Field 

This  field  consists  of  a  series  of  extensi¬ 
ble  2*bit  words,  one  Kquence  of  such 
words  applying  to  each  descriptor  ele* 
inenL  In  nrast  eases  a  single  2*btt  word 
would  suffice:  the  options  here  are  that  a 
1. 2.  or  5*byte  field  is  reserved  for  the  asso¬ 
ciated  descriptor  identification  number, 
the  fourth  option  is  that  two  2*bit  words 
are  being  employed.  If  the  second  2-bit 
word  is  employed,  the  associated  options 
are  that  4. 5.  or  6  bytes  are  being  employed 
to  indicate  the  associated  descriptor  iden- 
titlcation  number,  this  accommodates  up 
to  10  exp  12  possible  idemiTication  num¬ 
bers.  which  should  be  adequate.  The 
founh  option  available  for  the  second 
2-bit  word  indicates  that  an  additional 
4-bit  word  is  to  be  interpreted.  This  4-bit 
word  offers  16  additional  options,  the  first 
of  which  is  that  following  the  4-bit  word,  a 
1-byie  word  specifies  the  length  of  the 
des^ptor  identification  field. 

The  remaining  15  options  are  of  similar 
form,  but  indicate  that  types  of  descriptor 
data  are  being  employed  other  than  the 
sundard  identification  number  type.  For 
example,  type  2  would  indicate  that  ASGI 
text  was  being  employed  in  a  language  in¬ 
dicated  by  the  lint  character  of  the  text 
stream:  thus  256  possible  languages  can  be 
used.  Types  3  to  16  would  indicate  which 
of  seveni  possible  computer  languages  or 
image  description  formats  were  being  em¬ 
ployed.  such  as  C  Postscript,  etc  If  it  is 
fell  that  the  15  possible  languages  avail¬ 
able  under  the  4-bit  extension  option  in 
the  element  iencth-type  field  is  insd- 
equstc  then  addilional  4.bit  fields  could 
be  appended  by  using  an  extension  bit.  or 


by  using  1  bit  in  each  4.bit  field  to  indicate 
additiorul  4-bii  fields  are  appended  and 
can  be  interpreted  as  were  the  series  of 
descriptor  tail  element-number  4-bit 
words.  (Alternatively,  the  4-bit  word  could 
be  reduced  to  1  or  2.  with  the  understand¬ 
ing  that  the  language  is  sperifled  by  the 
first  following  2  i^es. )  Definition  of  these 
options  is  left  to  the  next  step  in  the 
SMPTE  standards  definition  process. 

SJJ.4  Descriptor  Tail  Standard 
Identification  Numbers 

If  one  or  two  2-bit  words  have  been  pre¬ 
viously  employed  to  indicate  the  length  of 
the  descriptor  idendfication  number,  than 
I  to  256  t^es  may  be  employed  for  the  ID 
number  itself.  The  formats  for  each  of 
these  options  are  indicated  below. 

1  byte  International  standard 

established  by  single 
designated  authority 

2  bytes  16-bit  standard  num¬ 

ber  designated  by  au¬ 
thorized  interrutional 
standards  body  (bod- 
fos) 

3  to  256  1  byte  indicating  the 

bytes  sovereign  state,  and 

the  remainder  (2  to 
255  bytes)  being  avail¬ 
able  for  the  standards 
number 

The  sovereign  sute  and  standards  num¬ 
bers  vould  be  designated  using  proce¬ 
dures  similar  to  that  specified  in  the 
header.  Note  that  the  Imger  standards 
numbers  permit  subdesignations  under 
the  sovereign  sute  indicator  for  subsidiary 
standards  todies,  including  individual  cor¬ 
porations.  institutions,  and  even  individ¬ 
uals.  The  longer  fields  also  permit  use  of 
user-defined  bits  that  can  be  assigned  at 
execution,  thus  providing  a  dau  field. 
Such  data  fields  could  be  used  for  convey¬ 
ing  dynamically  changing  information 
such  as  average  luminance  audio  pin. 
etc 

5J.6  Descriptor  Tail  Error  FroteaUm 

This  field  could  be  eoneentrated  at  the 
end  of  the  descriptor  or  distributed 
throughout  to  simplify  processing.  The 
oumber  of  bytes  and  protocol  employed 
for  this  purpose  would  be  determined 
oniquely  by  the  2  bin  in  the  descriptor  key 
T  field  and  the  descriptor  length,  as  speei> 
6ed  in  the  descriptor  core  descriptor 
length  field  or  in  the  descriptor  key  length 
field  Defirtition  of  these  protection  strate¬ 
gies  might  parallel  those  employed  in  the 
header  and  remain  to  be  defined  more 
folly. 

5.4  Tnnspon  HeaderiDeseripton 

S.4.J  Motivation  and  Ob/ectnies 

Section  5.17  discusses  several  reasons 
why  providing  error  protection,  synchroni- 


zaiion  remforcemenL  packet  priority.  a» 
higher  level  encryption  to  header/descrip 
ton  could  pose  problems  for  interpretivi 
hardware  it'  the  data  is  excessively  noi^ 
Although  the  solutions  suggested  then 
will  accommodate  most  error  environ 
ments.  still  more  serious  situations  can  bi 
handled  using  a  transport  header/deserip 
tor  block  such  as  described  here.  Such  : 
transport  block  has  the  additional  advan 
tage  that  if  insuffldencly  protected  data  t 
moving  into  a  more  hostile  transport  esvi 
ronment.  addilional  protection  cut  be  ih 
corporated  in  the  trattspon  bi^  withou 
having  to  redefine  the  input  blocks.  SW 
lariy,  such  transport  blocks  can  be  m 
mo^  without  penalty  when  moving  iMt 
more  error-free  or  otherwise  benip  etm 
ronments. 

The  objectrve  here  is  to  aggest  bow  tfh 
architecture  of  such  transport  blocks  «i 
consistent  vrith  the  header/descriptor  deS 
nitions  presented  above,  but  not  to  deSot 
all  the  details.  Thus  establshmem  of  dh 
header/descriptor  standard  can  pfooeoc 
without  waiting  for  ail  details  of  the  tnas 
pon  block  to  be  resolved.  It  «muld  be  cm 
fol  to  resolve  such  deuils.  hmvever,hcfisn 
users  of  the  sundard  adopt  mferior  meih 
ods  of  addressing  the  same  transport  prab 
lems. 

The  principal  motivation  for 
trattspon  blocks  is  to  avoid  proliferatiaBO 
standard-specific  alterrutives  for  addram 
ing  such  transpon  problems.  Such  prafif 
eration  could  increase  the  cost  ofdecodini 
equipment  that  would  have  to 
date  all  these  possibilities.  In  a  fogh-eira 
environment,  executing  the  muhipk 
search  strategies  neeesuty  when  ytchro 
nization  is  lost  or  heavy  errors  exist,  cook 
become  prohibitive.  For  this  reason  it  ■ 
imponant  to  have  only  a  few  standard 
tions  for  ceruin  aspects  of  the  nanspon 
block.  Defining  an  effieem  small  set  ii 
beyond  the  scope  of  the  present  effort  and 
would  take  considerable  study.  Thercfom 
it  is  reasonable  to  assume  that  this  stadjy 
would  be  completed  after  any  initial  head- 
er/descripior  sundard  is  spewied.  One  B> 
histrative  candidate  for  su^  a  set  appean 
here  in  A^ndix  A:  it  is  huended  only  n 
initiate  dtscussion  of  these  issues. 

5.4,2  ArehiteeaoeafThnspartBteeh 

Transpon  header/descripior  bioeks 
would  consist  of  a  single  header/deKij^ 
for.  where  the  header  would  yecify  as 
iatemational  standard  ID  indicatiag 
which  type  of  transpon  bioek  was  in- 
wived:  only  a  few  such  types  vould  ever 
be  defined.  The  descriptor  of  the  tnaa- 
pon  block  would  convey  up  »  e^  differ¬ 
ent  elements: 

I.  Descriptor  ubie  of  contents  (sum 
dard  format  oefined  earlier). 

X  Synchronization  rctnfereemem  bin. 
not  error  protected. 

3.  Error-protection  bits  for  the  va«- 
pon  header  and  its  attached  foOewiag 
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beader/ducriptor. 

4.  Enervptwn  key  for  deciphering  the 
descriptor,  if  any.  in  the  following  block. 

5.  Block  priority;  determined  by  dau 
originator,  mdleating  relative  priority  of 
the  following  block  concerning  interpreta* 
tion  or  transmission  in  cases  where  mad* 
equate  capacity  is  available.  Authorization 
keys  may  also  be  needed  to  verily  priority 
in  eertain  eases.  Price  bidding  oould  be 
supponed  here  too. 

d.  Authorizadbns  and  fee  taechanisms 
fbr  alteration  or  use  of  data. 

7.  Block  sequence  numbering  and  tint* 
in|-reeonstruetion  information. 

8.  Padding  to  yield  one  of  the  very  few 
aUowed  lengths  for  the  trattsport  block 
corresponding  to  the  mtematio^  header 
sandard  ID. 

In  addition  to  these  eiesnena  there 
would  also  be  the  traditional  field  in  the 
descriptor  defining  its  length,  although  in* 
terpreution  of  this  length  would  be  un> 
necessary  because  the  transport  lock 
length  is  specified  by  the  header,  and 
there  is  no  data  payio^  following  the  de* 
scriptor. 

Tlie  six  main  descriptor  elements  would 
convey  infotmation  using  traditionai  de> 
scriptor  sandard  numbers,  where  long 
numbers  accommodating  an  adequate 
number  of  user  biu  could  be  employed. 
Defining  these  descriptor  standard  num> 
bets  is  the  task  that  can  and  probably  must 
be  postponed  until  the  technical  tradeofi 
associated  with  different  choices  are  bet* 
ter  understood.  Thus  standards  for  head* 
ers  and  descriptors  can  and  should  be 
adopted  in  advance  of  these  descriptor 
definitions  for  transport  blocks. 

5.4J  Dteoding  bam  for  Trmiuport 
Mmdm 

Decoding  such  a  transport  header  in  a 
hifh*crrTor  environment  would  be  relative* 
ly  straightforward.  First,  if  synchronization 
had  been  km.  synchronization  would  be 
established.  Initially  this  might  be  done 
assuming  a  lew<rror  environment.  If  the 
environment  a  noisy,  then  each  of  a  few 
possible  synchronization  blocks  would  be 
sought,  including  periodic  repemioo  of  le* 
gal  header  keys,  assuming  that  key  bursts 
might  have  been  employed  tor  this  pur* 
pose.  Because  all  possible  syiMriironization 
words  might  be  sought,  it  is  important  to 
have  only  a  few  legal  ones  if  they  are  many 
byta  long.  Because  the  lyiK  reinforee* 
ment  bits  could  be  only  in  a  small  number 
of  positions  relative  to  the  beginning  of  the 
transpen  block,  each  such  positioa  eould 
be  tested  tor  eonsisiency  with  the  assodat* 
sd  erTor*ptoteetion  bits,  which  are  also  lo* 
eated  in  a  small  number  of  possible 
iocations.  Confirmation  of  synchroniza* 
tion  follows  if  legal  headers  come  nmedi* 
ateiy  after  the  indicated  end  of  any  block. 
Once  the  uanspon  block  is  synchronized 
and  error  eorreoed.  the  renuining  dc* 
seripior  fields  eonuining  any  enciyption 


key  for  the  descriptor  in  the  tolluwing 
block,  or  any  packet  pnoriiy  mformation 
can  be  dedphered. 

One  prindpal  new  constraint  should  be 
imposed  by  the  standard  on  manufactur* 
ers  of  equipment  handling  this  header/de* 
scriptor  sandard.  If  transpon  blocks  are 
to  be  useful  such  equipment  must  never 
insert  dau  between  a  transport  block  and 
the  following  block  to  which  it  applies. 
Transport  systems  should  try  not  to  scram* 
ble  the  sequence  of  daa  blocks  in  any 
event,  but  if  a  transport  block  should  aed* 
dentally  be  prepended  to  the  wrong  daa 
block,  the  ^cket  priority,  encryption  key 
for  the  dau  block  descriptor,  and  the  error 
protection  for  the  beader/descriptor  could 
be  ituppropriate.  resulting  in  a  scrambling 
of  the  interpreted  header/descriptor.  Such 
scrambling  would  also  typically  cause  loeal 
bss  of  synchronization,  particulariy  in 
bigh*enor  environments.  Since  tran^n 
blocks  would  normally  be  quite  brief  com* 
pared  to  typical  dau  blocks,  such  a  eon* 
straini  should  not  be  difficult  to  atisfy. 
Such  transpon  blocks  could  also  be  added 
or  subtraned  at  will  by  a  given  transpon 
layer,  however,  provided  they  are  appro* 
priate  to  the  blocks  which  they  precede. 

If  a  data  stream  is  entered  in  the  middle 
of  a  transpon  block,  then  confusion  might 
result.  To  protect  against  this  unlikely  po^ 
sibility.  equipment  might  choose  to  wait 
until  the  second  valid  header  is  intercept* 
ed  before  commencing  decoding. 

6.  Abstract  Syntax  Notation  1 
(ASN.l)  Header/Descriptor 
Architecture 

fi.1  Background 

Abstraa  Syntax  Noution  1  (ASN.l)  is 
an  existing  ISO/CGTT  Standard  in  com* 
mon  use  within  the  computer  and  ule* 
eommunications  industries.  Within  the 
ASN.l  ftamewerk.  it  is  straightforward  to 
define  an  SMPTT  header/descriptor  that 
meets  the  objenives  described  in  Sections 
1  to  5  above,  and  it  would  leverage  existing 
tools,  expenise.  and  administritive  souc* 
tures. 

ASN.l  is  derived  horn  earlier  work  at 
Xerox  PARC  on  Courier  (law  1970s).  An 
early  version  of  the  notation  (fc  1984)  was 
used  in  the  first  draft  of  the  CCITT  X400 
series  of  rceonunendations  on  mesuge 
handling  systems  (ie.,  electronic  mail). 
ISO  and  (TCITT  then  jointly  developed 
ASN.l  far  use  within  the  OSl  presentstion 
layer  (c  1988). 

ASN.l  is  nmv  widely  used  in  a  range  of 
standards  aaivities,  nduding  the  CCITT 
XiOO  directory  service  and  both  the  OSI 
and  Internet  network  management  sys¬ 
tems.  Over  the  years,  a  eoUeoion  of  soft* 
ware  toob  and  utilities  to  support  ASN.l 
has  been  (and  is  being)  developed. 

(J  Concepts 

ASN.1  is  an  extensible  noution  for  de¬ 


scribing  dau  that  is  to  be  exchanged  by 
transmission  or  storage.  It  is  much  tike  a 
programming  language,  such  as  C  and 
Pascal.  There  are  several  simple  types, 
such  as  integer,  real,  and  octet  string  (le> 
byte  string),  and  cons  w«or  typ«  that  can 
be  used  to  build  arbitraniy  complex  dau 
structures,  including  hierarchical  repre* 
senutiotu  (e.g.,  packet  within  packet). 

An  ASN.l  header  can  be  thought  of  as 
an  envelope  that  conuins.  for  example,  a 
single  video  ftame.  ASN.l  supports  the 
notion  of  embedding,  which  all^  one  or 
more  dau  structures  to  be  contained  with¬ 
in  another.  Thus,  a  sequence  of  ftames  can 
be  embedded  within  an  outer  header  (or 
CTivelope)  that  labels  a  program  segment. 
This  can  be  taken  to  coarser  granularity, 
c.f.,  shots,  scenes,  programs,  etc.  Similar¬ 
ly,  it  can  be  oken  to  finer  panularity  to 
embed  audio  tracks,  dosed  captioning,  de* 
seriptofv  etc.,  within  individual  frames. 

A  key  feature  in  ASN.l  is  the  separation 
of  how  the  dau  is  described  (Abstract 
Syntax)  and  how  dau  is  encoded  (Basic 
Encoding  Rules,  or  BER).  Dau  structures 
are  described  in  a  hurrun-readable  syntax 
and  automatically  translated  into  the  bits 
and  bytes  for  transfer.  When  a  new  dau 
structure  (or  type)  is  defined,  ia  represen* 
ution  is  automatically  generated.  Punher* 
more,  deployed  ASN.l  compliant  systems 
will  be  able  to  mterpret  new  structures 
without  hardware  modifieatioa 

The  fbUowing  sumtruty  description  of 
ASN.l  presenu  only  enough  deuil  to  mo¬ 
tivate  tu  use  for  the  spedfie  needs  of  a 
beader/descriptor.  For  formal  definition 
of  ASN.l.  refer  to  ISO  8824^825  and/or 
CCITT  XJ08/209.  A  more  accessible  de¬ 
scription  can  be  found  in;  Marshall  T. 
Rose.  The  Open  Book:  A  Pmaieei  Penpee- 
tne  on  OSL  Prentice-Hall  1990. 

Basic  Eacoding  Rules  (BER) 

All  ASN.l  types,  whether  a  simpie  type 
or  a  structured  type,  can  be  encoded  using 
the  same  basic  fontut  of  three  fields: 
(tag]  (length)  (value) 

The  three  fields  together  make  up  a 
dau  hem.  Each  field  is  variable  in  size  to 
accommodate  arbitrarily  complex  sub* 
structures  and  encoding^  A  simpie  type, 
such  as  an  inw|er,  requires  only  a  few 
bytes.  A  structured  type,  such  as  a  long 
^se  stimg,  can  be  M^es.  Gbytes,  or  larg¬ 
er  as  necessary  to  eomain  the  payload  dau 
i«lue.  The  ba^  format  is  inhemtiy  seif* 
identifying  and  extensible. 

A  dau  stream  is  a  sequence  of  hems, 
each  of  which  can  be  struenired  or  nested. 
Thereby,  one  can  define  arbitrarily  struc¬ 
tured  dau  for  both  header  and  destnipior, 
ineiuding  nested  paeket*wittain*paeket 
structufts. 

"Tag  FieU*.  The  tag  field  specifies  the 
type  of  the  hem  value.  Several  simple  and 
structured  types  (integers,  character 
arings,  etc)  are  univetsaily  defined  in  the 
ASN.l  standard,  and  are  recognized  in  all 
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eomplianc  environments.  Also,  one  can 

•  define  types  that  exist  within  the  specjfic 
environment  of  an  application  or  eommu* 
nieations  context.  A  tag  is  ptincipaJIy  en> 
coded  as  a  single  b^e.  but  can  be 
extended.  In  the  BER.  the  tag  is  encoded 
as: 

<2>  <1>  <S>  -bitvlield 

(class  I  p  I  tag  number] 

The  ug  Held  allows  a  receiver  to 

*  "parse"  the  incoming  dau  stream,  select* 
ing  those  eomponents/rypes  in  which  it  is 
imerested  and  bypassing  others. 

The  EXTER.VAL  type  is  of  partieular 
significance.  In  essence,  it  is  a  univeisal 
hnder  for  the  dau  that  it  encapsulates. 
The  EXTERNAL  type  is  described  fur* 
ther  below.  Other  types  are  relevant  to  use 
in  a  descriptor. 

*Lmph  Field*.  The  length  field  indi* 
cates  the  sue  of  the  value.  It  is  an  integer 
of  one  or  more  bytes  that  specifies  the 
number  of  bytes  in  the  value  field.  In  the 
BER.  the  short  form  of  length  is  encoded 
as  a  single  byte,  and  can  indicate  lengths  of 
0.126  bytes  of  value.  In  extended  form, 
the  first  byte  specifies  the  number  of  bytes 
of  length.  Length  is  encoded  as; 

short  form  :  1  byte  :  (Obbbbbbb]  : 
lengths  of  0..1126  bytes 

extended  form :  n  bytes :  (Iruutnnnnj 
(bbbbbbbbj  (bbbbbbbbj  ..Ji 

Only  the  number  of  bytes  needed  to 
speafy  length  are  used.  Thus,  the  length 
field  is  compact.  The  extensibility  of  the 
length  field  permits  a  maximum  length 
field  that  is  126  bvtes  to  specify  a  length  of 
-2*1008  or  -10*303. 

Note  that  a  common  length  ipecifica* 
tion  is  used  regardless  of  the  assodaud 
data  item.  Accordingly,  there  is  no  need  to 
invent  custom  length  encoding  schemes 
for  each  new  data  item. 

*yaiue  Field*.  The  value  field  is  the  val* 
ue  in  the  type  specified  by  the  tag.  It  is  a 
string  of  b^es  the  number  of  which  are 
specified  by  the  length  field.  It  is  encoded 
as  deftned'in  the  BER  for  that  type. 

M  ASN.1  EXTERNAL  Type 
(Uoivenai  Header) 

An  ASN.l  EXTERNAL  type  is  a  uni* 
uersal  header.  All  ASN.1  eoti^diam  proto> 
col  interpreters  can  extract  and  interpret 
an  EXTERNAL  without  ambiguity.  The 
definition  of  EXTERNAL  is  quite  fiexi* 
We.  but  that  flexibiii^  is  not  needed  here 
to  meet  the  bask  ebjieewes  of  a  uruversal 
header. 

A  simplified  EXTERNAL  type  is  en* 
coded  as: 

(»*l  ■  class  ■  O.p  ■  Lnumber  •  8] 
(length)  (object  id]  (payload] 

*Tag*  and  *length*  fields  are  encoded 
as  described  above.  *Obj«a  ID*  provides 
unambiguous  selMdenufication  for  the 
header.  *Payload*  is  a  sequence  of  bytes 
that  are  interpreted  according  to  the  sun* 
dard  indicated  by  the  object  ID. 


*Objea  ID*  is  itself  an  ASN.l  type  with 
Ug.  length,  and  value  fields,  and  is  en* 
coded  as: 

(ugw  0.0.6]  (length)  (id  value] 

*Objea  ID*  value  is  a  sequence  of  bytes 
that  represent  the  hierarchical  identifier 
for  the  referenced  standard.  ID  values  are 
assigned,  registered  and  administered  by 
CU  l  'l  and  ISO  in  the  course  of  sundards 
development  Or.  ID  assignment  can  be 
delegated  to  member  bodies  or  companies 
or  organizations  (thereby.  SMFTE  could 
assume  responsibility  for  administering  a 
portion  of  the  ID  speee.)  The  root  prefbt 
values  are: 

ccnT(o]  _ 

fccommendation  :  CCITT  conunit* 
tees 

question(l]  :  CCITT  Study 

Groups 

admiatstration(2]  :  countiy  Prs 

(country  cede) 

network  operator :  X121  erguiza* 
dons 

ISO(l) 

sundardjO]  :  ISO  standards 

registration  :  ISO  authorities 

authority)  1] 

member  :  member  bodies 

body[2|  (country  code) 

identified  :  organizations 

organization(3] 

joint  ISO  :  assignment 

CCm[2]  :  delegated  to 

ANSI 

A  few  prefixes  are  of  particular  note. 
iBo.standard  registets  ail  ISO  standards. 
cdtLadministration  and  iso.memberbody 
are  assigned  to  sovereign  bodies  (identA 
fled  by  their  international  telephone  coun* 
try  code),  isoxitganization  is  assigned  to 
international  orgwizations.  These  cover 
vinually  all  the  situations  under  which  a 
header  identifier  wilt  need  to  be  assigned. 

An  ID  value  is  encoded  as  a  sequence  of 
bytes.  The  fint  two  levels  are  encoded  in 
the  first  byte  •  e,b.  •  40*a  *b,  a  <  ■  3, 
h  <  «  39.  Remaining  levels  are  encoded 
as  one  or  more  bytes  as  needed  to  repre* 
sent  the  numerical  value  for  that  level.  If  a 
value  is  greater  than  127.  it  requires  more 
than  one  byte:  the  MSB  of  the  byte  is  set  to 
indicate  that  the  value  is  continued  in  the 
next  byte,  and  so  on.  For  exan^le,  'ao. 
sandard.jpeg  is  3  bytes: 

ao.standard.jpeg  s  li).109l8* 

•  (40)  (128i>8S]  (38) 

Smilariy.  Group  3  Fax  is  identified  ax; 
ecitLrecomfflendation.t.4  :  0X120.4 
-  (OJ  (20)  (4) 

The  extensibility  of  the  ID  value  field 
permiu  up  to  126  byte  long  IDs  to  specify 
distinct  IDs  numbering  to  -3*8K  or 
-10*265. 

*.^  iim  wntMf.  JfEC  »  Maniif  ■«  ««i  m  ISO 
•usMre.  TInm.  iSoitth  meiifiii  le  ke  coma.  Uic 
MMiecr  Set*  ilAlOVtsi «  mi  edicwl. 


*Payload*  is  an  ASN.l  type  with  *fi 
length,  and  value  fields,  as  a  se* 

quence  of  bytes  that  are  interpreted  ae* 
eording  to  the  standard  indicated  by  the 
*Objea  ID*. 

(ug«  2.0.1]  (length]  (payload  value] 

ASN.l  Descriptor 

In  addition  to  the  EXTERNAL  type 
(for  use  as  header),  ASN.l  defina  other 
basic  types  to  represent  a  variety  of  values, 
induding:  booieans.  inttgeis,  reals,  byte 
strings,  character  string  univeisal  time 
code.  etc.  and  eonstruciioos  of  value  hue 
arbitrary  dau  structures. 

Although  a  disdnciion  is  drawn  be* 
tween  header  and  descriptor  in  this  fc* 
poa  the  ASN.l  approach  permitsadn^ 
mechanism  to  serve  both  functions.  The 
full  beneSu  of  ASN.1  become  ipparent 
when  it  is  allied  to  the  descriptor.  Eipe* 
dally  the  ability  to  eontrua  new  9pes  m 
to  incorporate  reference  to  other  aan* 
dards.  (See  current  ISO  work  on  Image 
Interchange  Format  [IIF]  for  an  oenple 
of  ASN.l  use  in  defining  dessipiofs.) 

6J  How  ASN.1  Addrexse 
Objectives 

The  following  describe  how  an  ASN.1 
beader/descriptor  addresse  the  objecowe 
suted  at  the  beginning  of  this  report 
Univeisality  -  ASN.1  beadeedeseri^ 
lor  promote  and  enhance  untversaiity: 

•  It  eomplie  with  and  recognize  CBB* 
ing  standards  and  practicet  All  taBing 
and  future  ISO/CCITT  standards  ec 
uniquely  identified  by  an  ASN.1  Objea 
ID. 

•  It  addresse  the  issue  ot  sovereignty. 
ISO/CCnr  adminsten  and  delegate  a^ 
signment  of  Object  IDs  to  subeommittect 
member  bodies  (by  countiy  code),  and  or* 
ganizations.  The  complexity  of  this  usk 
and  the  benefits  of  leveraging  existing  ad* 
ministrauve  structure  should  not  be  ei* 
deresttmated. 

•  It  fadliute  coordination  among  tele* 
vision,  telecommunications,  and  computer 
industhes. 

•  It  seu  a  minimal  level  of  compHince 
for  iow<ost  receivers.  Funhennote.  the 
advanced  suge  of  definition,  tools,  and  ca* 
pertise  will  fodiiute  the  t^d  deploymem 
of  header<ompliant  devices. 

Longevity  •  aSN.1  be  inhetem  loo* 
gevity: 

•  All  fields  of  ASN.1  type  (ag,  kngth. 
value)  cu)  be  exunded.  Peyioad  lengths 
from  a  few  byte  to  - 10*303  can  be  repre* 
ented.  Similarly.  01^  IDs  can  range 
from  a  few  byte  to  - 10*265  bytes. 

•  It  he  a  preexisting  registry  ^  kadf* 
oiaintaining.  ISO/CCTTT  airoady  icfii* 
lers.  administers,  and  delegates 
assignment  of  Object  IDs. 

•  It  defines  immutable  identifiers. 
Once  an  Objeo  ID  is  aaigned.  it  csiBS 
"for  all  lime."  In  the  foture.  when  an  eW 
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ASN.1  header  is  recopiized.  there  is  no 
ambiguity  to  the  referenced  sundard. 

EttensibUity  —  ASN.]  is  inherentiy  ex* 
tensible: 

•  All  fields  of  aSN.I  types  can  be  ex¬ 
tended  without  redefinition. 

•  New  types  can  be  defined  and  their 
encoding  automaocaJly  generated  without 
the  need  to  introduce  new  rules. 

•  Since  ASN.I  is  fully  defined,  any 
compliant  receiver  and  equipment  are 
fuaranteed  to  be  able  to  recognize  fitture 
extended  ASN.I  headers. 

Interoperability  •>  ASN.1  it  inherenily 
imerope^ie: 

•  It  has  a  weO-fonned  public  definition. 

•  It  is  already  in  use  in  several  impor¬ 
tant  applications  and  industries. 

•  It  complies  with  etdsting  and  develop¬ 
ing  standards  (including  image  standards). 

•  It  allows  standards  and  struentres  to 
cress-reference  each  other.  The  video 
data  stream  can  contain  structures  defined 
by  other  standards,  and  vice  versa. 

*•  It  permits  the  same  information  to  be 
interpreted  in  different  ways  within  differ¬ 
ent  domains  without  prejudice.  To  the 
video  industry,  the  inlormaiion  is  a 
continuous  video  stream.  To  the  telecom¬ 


munications  industry,  the  same  informa¬ 
tion  is  a  sequence  of  bits  to  be  transmitted. 
To  the  computer  industry,  that  sequence 
of  bits  is  interpreted  as  data  structures. 

•  It  permits  independent  formal  defini¬ 
tion  of  imellectuai  property  protection, 
encryption,  and  other  source  coding  de¬ 
scriptors  by  appropriately  sanaioned  ex¬ 
pert  groups  (either  ad  hM  or  extant). 

•  Experts  groups  and  standards  bodies 
can  autonomously  develop  and  evoNe 
specifications  within  the  domains  of  their 
expertise.  The  ASN.1  cress-referencing 
capability  achieves  a  degree  of  interopera¬ 
tion  and  coordination  among  parallel  acti¬ 
vities  —  coding  deuils  are  automadcally 
resolved,  avoiding  redundant  and/or  con¬ 
flicting  efforts. 

Cosuperformance  effectiveness: 

•  It  is  straightforward  to  recognize  and 
decode  with  a  single  onifoim  procedure. 

•  Uniform  dec^  hardware  and  soft¬ 
ware  can  be  shared  among  industries  and 
applications  yielding  economies  of  scale. 

•  Existing  tools  and  expentse  can  re¬ 
duce  the  time  and  cost  of  deploymenL 

Compactness: 

•  ASN.I  is  compact,  but  not  so  much  as 
to  complicate  decoding  or  to  compromise 


other  oojeetives.  A  typical  ASN.I  header 
would  be  - 15  bytes  for  a  1-Mbytc  payload 
(Le-  -.0014^  overhead). 

•  A  7-byte  ASN.I  header  can  be  real¬ 
ized  using  the  standard’s  indirect  refer¬ 
ence  option.  (In  fact,  a  2-byte  context 
specific  header  may  be  realizable.)  Given 
typical  payload  sizes,  however,  it  is  unlike¬ 
ly  that  this  level  of  eompanness  will  be 
required  —  small  payloads  are  either  ag¬ 
gregated  or  infrequently  used,  especially 
in  the  video  domain. 

Rapid  Capture: 

•  An  ASN.I  header  is  signified  by  io 
first  byte  tag  field  (equal  to  8)  followed  by 
length,  ebjea  ID.  and  payload.  Thus,  an 
ASN.I  header  is  straightforward  to  recog¬ 
nize  in  the  data  stream. 

•  >^N.l.  like  any  lengtlifidentifler 
header,  provides  eariy  identification  of  the 
payload. 

Ediubility: 

•  ASN.l's  structuring  apabilities  per¬ 
mit  arbitrary  editing,  sequencing,  struaur* 
ing,  and  embedding  of  payload  streams. 
All  of  this  is  accomplished  within  a  single 
uniform  mechanism,  and  without  requir¬ 
ing  unnecessary  decoding  of  the  payload 
itrelf. 


Appendix  A  —  Transport  Header 
A.0  Introduction 

The  header^escriptor  design  provides  for  binding  of  a  trans- 
pon  header  to  the  main  header.  This  is  accomplished  only  by 
mandating  that  the  traasport  header  not  be  separated  from  the 
main  header  and  its  associated  data,  which  it  is  transporting. 
"The  funakin  of  the  main  header  is  to  identify  the  data  that 
follows.  The  function  of  the  transport  header  is  to  help  the  main 
header  and  possibly  its  payload  in  its  journey  from  origination  to 
destination. 

The  following  transport  header  design  is  “work  in  progress.' 
and  thereiore  is  meant  to  be  an  example  rather  than  the  final 
structure.  It  supplements  the  discussion  in  Section  5.4.  The  de¬ 
sign  issues  and  intcraaions  are  a  bit  complex,  so  the  principles  of 
the  transpon  header  design  are  best  illustreted  by  way  of  a  m- 
reeily  designed  example.  If  adjustmenu  are  made  to  this  design, 
then  care  must  be  taken  concerning  effects  of  such  adjustments  to 
the  rest  of  the  design  and  the  functioning  of  the  transpon  header. 
In  particular,  there  is  a  rigid  requirement  that  certain  fields  be 
prespeeified  as  to  length,  or  as  to  lengih-spedficaoon  (type  fields) 
m  their  meaning. 

It  is  necessary  that  the  transport  header  be  totally  indepeni^t 
of  any  standard  described  by  the  main  header.  TTus  is  reqiured 
beeaiuc  it  may  be  necessary  to  change  transpon  charactemties 
ktr  all  main  headers  on  a  given  data  stream,  irrespective  of  the 
standards  or  formats  of  thitae  main  headers.  _  _ 

An  example  might  be  the  need  to  provide  impiwed  era 
protection  when  mming  a  data  stream  from  a  bigh-reiiabtlity 
fiber  to  a  high-eTTor-rate  radio  frequency  transmission. 

A.O.J  ASNJ  Trwpon  Hmdtr  Ym»b«  Daipttd 

The  transpon  header  design  example  illustrated  here  works 
mgethcr  wuh  the  header/desenptor  design,  as  described  in  Scc- 
tnifts  I  through  5  of  this  SMPTE  report.  . 

.No  transport  header  has  yet  been  designed  for  the  ASN.I 
wniax.  Mlusirated  in  Section  6.  Thus,  the  ASN.I  syntax  ia.  at  pres- 
..u.  only  suiubie  for  error-free  channels  that  preserve  the  dau 


and  its  order  without  contention  from  source  to  destination.  In 
order  for  the  ASN.  1  syntax  to  meet  its  ojeetives  of  interoperability 
with  imperfect  or  eemgested  channels  or  media,  a  mechanism 
similar  to  the  transpon  header  example  shown  here  will  be  re¬ 
quired.  it  is  hoped  that  a  transpon  header  design  might  be  devel¬ 
oped  for  the  ASN.I  ^tax  system,  possibly  modeled  on  the 
transpon  header  illustrated  here,  together  with  the  main  header/ 
descriptor  design.  TTic  enormous  flexibility  of  ASN.  1  syntax  must 
be  tempered  to  provide  a  limited  number  of  options  for  transport 
headeis.  each  with  appropriate  protection/cofTeetion  mecha¬ 
nisms.  It  is  hoped  that  registration  rules  and  flexibility  in  ASN.I 
can  also  be  used  to  provide  a  suitable  format  transpon  header 
design  that  is  property  restrined.  Byte  alignment  a  also  pan  of 
the  structure  of  length  fields.  If  bit  alignment  is  needed  for 
ASN.I.  then  funher  suiubie  adjustments  will  be  required. 

Another  method  to  provide  transpon  assisunce  is  to  convert 
from  the  main  header/descriptor  design  of  Sections  1  through  5. 
to  and  from  the  ASN.I  noution.  Since  a  transpon  header  design 
is  available  for  the  main  header/deschptor.  conversion  to  this 
header  would  provide  access  to  a  transpon  header.  When  reen¬ 
tering  error  and  contention-free  environments,  the  header/de- 
tcriptor  could  be  reconvened  to  ASN.I  syntax. 

If  the  ASN.I  method  becomes  popular,  then  it  is  hoped  that  a 
stitable  ASN.1  transpon  meehatiism  might  be  developed. 

A.t  Design  Otiftetiwes  fit  the  THntpm  Header 

•  The  transpon  header  must  be  standard-ID  independent,  so 
that  it  an  apply  to  all  header  standards  equally  and  uniformly 
across  the  entire  dau  stream. 

•  The  transpon  header  should  be  removable  without  damage 
10  the  function  of  the  main  header  that  it  is  helping  to  transport. 

•  A  transpon  header  must  be  able  to  be  added  to  any  header 
format  or  de^ptor  fonnat  without  changing  any  of  the  irwaning. 

•  The  transpon  header  formats  should  suppon  *in  ^  etear' 

proteaiofl  of  the  main  header  and  payload,  where  the  bits  are  not 
altered,  so  that  the  transpon  header'ean  be  added  and  removed 
without  any  adjustment  of  bits  within  the  main  header,  its  de¬ 
scriptor.  and  itt  payload.  _ 

•  In  addition  to  suppon  for  ~in  the  dear'  preieaioa  more 


SMPTE  Journal.  Jund  1992 


430 


'  efTident  protection  should  be  supponed  (such  as  Reed>Solo* 
monl.  where  the  header,  and  pwibiy  iu  payload,  are  encoded. 

•  The  transport  header  architecture  should  suppon  one  or 
.  more  mechanisms  for  correcting  bum  errors. 

•  Optional  support  should  be  provided  for  encryption  of  the 
main  header's  descriptor,  as  well  as  the  dau  payload. 

•  The  transport  header  should  suppon  authorization  and  use 
information,  in  helping  the  transpon  system  determine  which 
destinations  are  appropriate  for  funher  or  final  distribution. 

•  The  transpon  header  arehiteoure  should  suppon  backward 
play  through  the  dau  stream. 

•  The  trarispon  header  should  suppon  optional  rapid  header 
synchromzation  capture. 

•  The  tran^n  header  should  suppon  communications  net* 
works  by  providing  information  concerning  the  dau's  priortty  and 
value. 

•  The  transpon  header  should  suppon  timing  reconstruoion 
when  utilizing  networks  that  diston  timing  or  ordering  of  data. 

•  The  transpon  header  should  suppon  dau  ordering  require* 
ments.  when  utilizing  networks  which  might  reorder  the  data. 

•  The  transpon  header  architeciure  should  strike  a  balance 
between  the  opposing  forces  of  flexibiiity  and  ease  of  use.  Ihus.  a 
small  number  of  options  should  be  earefolly  chosen  for  maximum 
flexibility,  with  the  small  number  of  options  allowing  a  simple 
interpretation.  It  is  the  faa  of  having  a  small  number  of  options 
that  allows  easy  interpreution. 

•  For  transpon  functions  that  encode  the  header  anoUor  its 
(lata.  4  simple  ~in  the  dear"  length  field  should  allow  devices  that 
cannot  decode  such  data  to  skip  to  the  next  transpon  header. 

•  For  devices  that  cannot  process  the  transpon  header,  a  sim* 
pic  "in  the  clear**  length  field  should  allow  such  devices  to  skip 
directly  to  the  header.  Funher.  such  devices  should  be  able  to 
easily  interpret  the  transpon  header  format  so  they  can  quickly 
determine  whether  the  main  header  is  ~in  the  clear.**  and  there¬ 
fore  direnly  readable. 

Tlie  following  example  design  meets  these  objeetives. 

AJ  Three  lypes  of  TVansport  Header  Allocated  in  the 
Header  Key 

In  order  to  meet  these  objectives,  there  are  three  types  of 
transpon  headers.  The  first  is  the  basic  transpon  header,  which 
provides  the  majority  of  transpon  capabilities.  The  second  is  the 
redundancy  transpon  header,  which  »  used  to  proten  against 
burst  errors.  The  third  is  the  reverse  transpon  header,  which  is 
used  for  reverse  play. 

Three  of  the  2-b>ie  header  key's  2S6  possible  codes  are  re¬ 
quired  for  the  transpon  headers.  In  the  current  ^-byie  header  key 
design.  1  byte  is  dedicated  to  error  protection.  Tlie  other  byte  is 
split  into  two  a-bit  flelds. 

The  first  4>bit  Held  is  the  ‘length  type**  field.  Codes  15  and  16 
(numbers  14  and  15)  are  unallocated.  ‘These  two  codes  enable  the 
second  a-bit  ID  field  to  be  used  for  special  purposes  such  as 
designating  the  three  transpon  header  types.  Thiny*iwo  such 
eodn  are  available,  leaving  29  codes  unallocated  if  3  codes  are 
assigned  to  the  three  transpon  header  types.  The  basic  tnmspon 
header  provides  most  of  the  capabilities  required. 

AJ  Functions  of  the  IVinsport  Header 

The  funetions  of  the  transpon  header  are  at  follows: 

1.  SytK  reinforcement  for  those  dau  transpon  media  where  it 
is  desirable  to  rapidly  or  simply  sync  to  the  headers  or  header- 
data  combinations  on  switching  between  streams. 

2.  Improved  error  protection  via  extra  protection  bits  for  both 
the  tnnspon  header  and  the  main  header  and  iu  descriptor. 

3.  Conveyance  of  priority  for  the  main  header  and  iu  data,  for 
those  cases  where  a  channel  may  be  operating  at  capacity  and 
thus  where  channel  eontrollets  must  decide  whicn  headers  and 
their  payloads  it  must  drop.  Authorization  keys  may  also  be  need¬ 
ed  in  otdcT  to  verify  priority.  Network  aeeessing  methods,  such  as 
pricing-bidding  le^iques.  would  also  be  supponed  here. 


4.  Encryption  and  secuniy  intomutiun  for  the  majn  header's 
descriptor,  possibly  combined  with  the  descriptor's  own  eaeiyp 
tion  and  secunty  information,  in  order  to  protect  the  dau  stream 
following  the  main  header.  The  protection  optionally  provided  by 
the  main  header's  deschptor  may  need  to  be  augmented  whea 
the  data  stream  is  sent  through  public  or  vulnerable 
channels. 

5.  Authorization  information.  Such  information  would  iadi* 
cate  who  could  receive,  who  could  edit  and  reassemble  with  other 
material,  etc  Alsa  copyright  and  royalty-fee  information  would 
be  enabled  here.  Perhap  auuwnated  mechanisms  of  fee  tonaage 
would  be  supported  through  this  field. 

6.  Sequence  numbers  may  be  added  where  networks  arc  used 
for  transport,  which  may  reo^r  packets.  In  addition  to  ttgifiww 
numbers,  the  combination  of  the  transport  header  may  rsquitt 
infonnation  from  the  main  header's  descriptor  in  order  far  the 
network  to  be  able  to  guarantee  deiiveiy  within  kn^  imumrimm 
and  time  windows.  For  out^-order  delive^,  some  networks  can 
control  the  amount  of  time  between  a  giren  delivery  and  the 
delivety  of  neighboring  data.  In  the  ease  of  images  and  audio, 
some  devices  can  accept  out^-order  tnfoimation  in  their  buffna 
or  processing  um'u.  but  the  time  is  eonsuained  to  wnhin  one  or 
more  frames,  or  fractions  thereof. 

7.  Timing  reconstruction.  For  those  applications  where  aaet 
timing  relationships  must  be  iceonstiueted  from  a  miaed  dmn 
stream,  the  transpon  header  and  the  main  header's  tWiftipww 
would  communicate  to  provide  dming  reconstruoion  mfaonadoo. 

8.  Reserved  for  future  use. 

9.  Pad.  There  is  a  pad  field  at  the  end  of  the  transpon  header 
in  order  to  make  the  length  of  the  uaispon  header  plus  the  auhi 
header,  its  descriptor  and.  optionalty.  iu  payload,  come  out  to  a 
length  appropriate  for  the  error^ofrection  proteoioo  fawais 
supporud  in  item  2. 

Tlie  construnion  of  the  transpon  header  involves  a  stria  or¬ 
dering  of  fields  as  above,  so  that  the  sync  reinforcement  is  ahrays 
'  first  the  error  protection  a  always  second,  etc  In  this  way.  IfcaA 
field  is  present  iu  location  is  easily  determined.  The  efrDr«pr» 
tenion  field  will  always  be  in  a  known  location,  and  a  “leope  of 
protenion'*  within  this  field  defines  those  fields  that  are  protect¬ 
ed  in  both  the  transpon  header  as  well  as  in  the  main  header aad 
iu  desenptor. 

A.4  Redundancy  Transport  Header 

The  redundancy  transpon  header  provides  a  special  (unedou 
for  error  protection  against  burst  errors  on  the  dau  sticain. 
Improved  error  proieeiioa  apinst  bum  errors  is  achieved  via 
redundant  copies  of  future  and  previous  headers  (these  transpon 
headers  providing  redundancy  will  not  be  bound  to  a  main  head¬ 
er.  and  therefore  represent  an  exception  to  the  binding  propeiy 
of  tnnspon  headeis). 

Such  special  redundancy  transpon  headers  will  ‘stand  alone* 
and  occasionally  imerspersed  in  the  dau  stream.  Tli^  wfl 
eonuin  error-protected  copies  (via  either  the  eSideni  or  inefli- 
cieni  method)  of  some  future  or  previous  header,  topther  with  a 
pointer  to  that  header,  and  a  number  indicating  how  many  head¬ 
ers  forward  or  backward  will  be  traversed  before  reaching  that 
header.  The  previous  copies  are  useful  tor  going  baekwanf 
through  a  dau  stream,  and  for  reconstructing  danupd  dan 
sireaim  on  physical  media,  such  as  disk. 

Using  a  sepante  header  key  code,  the  redundancy  transpon 
header  has  the  format  shown  in  Fig.  Al. 

The  transpon  header  key  eemains  a  aeparau  special  code 
indicating  that  this  is  a  redundancy  transpon  header  containing  a 
copy  of  a  future  or  previous  header.  This  field  could  ponibfy  be 
fouled  by  a  length  field,  indicating  how  to  skip  past  this  dupli- 
caie  header. 

The  pointer  to  the  header  being  dupUeaud  is  aiulegous  to  a 
lengt  h  field,  but  it  poinu  past  several  headers  to  the  header  being 
duplicated.  It  is  a  siped  number  so  that  it  eu  referena  previoui 
as  well  as  future  headers.  Thiny-iwo  bia  of  protection  are  pfo- 
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vided  A  number  of  headers  forward  or  badcward  is  provided 
mdicating  the  iocadon  of  the  header  being  duplicated  in  number 
of  headers  rather  than  via  a  pointer  (length). 

The  maximum  and  minimum  millisecond  tolerance  Selds  indi* 
cate  the  tolerances  for  separation  times  between  this  copy  of  a 


previous  or  future  header  and  the  header  itseif.  Information 
about  separation  constraints  is  provided  by  these  fleids  for  chan¬ 
nels  that  reorder,  insen.  and  remove  da^ 

A  copy  of  the  transpon  header  is  provided  if  there  is  a  trans¬ 
port  header  on  the  header  being  copied 
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Finally,  there  is  a  copy  of  the  complete  main  header  and  its 
associated  optional  desciipior.  None  of  the  payload  is  duplicated. 

A-5  Reverse  TVansport  Header 

In  order  to  support  reverse  reading  of  the  data  stream,  a  re* 
verse  transport  header  can  be  utilized.  The  reverse  transport 
header  immediately  follows  the  main  header.  Thus  the  tnnspon 
header  is  situated  as  shown  in  Fig.  Al 
If  the  main  header  is  preceded  by  a  transpon  header,  then  the 
reverse  transpon  header  will  point  back  to  both  the  main  header 
and  the  transpon  header.  If  the  transpon  header  is  absent,  then 
the  reverse  transpon  header  will  point  back  only  to  the  main 
header,  and  the  length  backward  to  the  transpon  header  will  be 
zero. 

The  reverse  transpon  header  format  is  shown  in  Fig.  A3.  'Dus 
design  allows  these  reverse  transpon  headers  to  be  appended 
after  the  main  headers  to  allow  backward  traversal  through  the 
dau  stream. 

It  should  be  noted  that  when  redundant  transpon  headers  are 
in  use  to  protea  against  burst  errors,  reverse  tnnspon  headers 
must  follow  each  such  redundant  header.  In  that  ease,  the  length 
backward  to  the  main  header  will  be  zero,  aitd  only  the  length 
backward  to  the  transpon  header  will  have  a  nonzero  value. 

Note  that  this  header  has  a  fixed  length.  Thus,  when  encoun* 
tehng  this  header  in  the  forward  direaion.  no  pointer  to  the  next 
header  in  the  form  of  a  length  field  is  required.  The  fixed  tenglh 
of  18  byres  is  predetermined  and  can  be  used  to  skip  to  the  next 
header.  Alsa  there  will  never  be  a  payload  of  dau  or  any  anach* 
mem  to  any  headers  in  the  forward  direction. 

A.6  IVaDsport  Headers  and  Device  Capture 
It  is  ifflponam  to  remember  that  ail  devices  that  can  edit  the 
dau  stream  must  preserve  the  relationship  of  the  preappended 
tnnspon  header  to  the  main  header.  Also,  the  postappended 
reverse  transpon  header  must  also  remain  attached  to  the  main 
header,  if  present.  Thus,  when  a  tnnspon  header  is  read,  the 
following  TNvo  headers  must  also  be  read  before  assuming  the 
header  and  its  dau  and  transpon  have  been  passed. 

When  opturing  a  new  dau  stream,  it  is  necessary  to  read  at 
least  two  headers  to  determine  if  the  first  main  header  is  valid.  If 
it  had  been  preceded  by  a  tnnspon  header  that  encoded  the 
main  header's  desciipior  and/or  payload,  then  the  dau  will  not  be 
readable  without  the  tnnspon  header.  Thus,  capture  is  not 
achieved  until  the  second  header  has  been  read,  which  would 
accomplish  the  determination  of  the  first  valid  header  and  its 
tnnspon  header,  if  present 

A.7  Transport  Header  Format 
The  tnnspon  header  format  is  shown  in  Fig.  A4. 

A.ZJ  im  tht  Mmdtr  Kff 

The  transpon  header  begins  with  a  normal  header  "key.*’  eon* 
listing  of  1  byte  of  key  information  and  1  byte  of  protection,  as  for 
the  main  header.  H^ever.  the  transpon  header  uses  a  header 
key  code  reserved  speafieaily  for  the  transpon  header.  The  trans* 
pon  header  uii  differs  from  the  main  header  tail  and  has  the 
format  outlined  in  IHg.  A4. 

A.FJ  I^ngrA  »  Nta  Htadtr  tfur  Maim  Mamdtr 

The  next  field  is  a  32  bit  length  field,  which  peina  to  the  next 
header  after  the  main  header  attached  to  this  tnnspon  header. 
This  length  may  be  required  to  allow  skipping  past  the  main 
header  and  its  ^u  payload.  This  will  be  required  by  thote  de* 
vices  that  cannot  provide  error-proteaion  decoding,  when  the 
main  header  is  proteaed  using  the  types  of  error  codes  that 
scramble  the  main  header.  This  length  is  proteaed  by  32  addi* 
tionai  preteaion  bitt  to  provide  reasonably  robust  bit*erTor  cor* 
reaion.  using  the  type  of  correaion  that  does  not  scramble  the  32 


bits  of  length  (e.g..  Hamming  code). 

A.7J  Transport  Htedtr  Ltnph 

This  field  indicates  the  length  of  the  transpon  header.  It  there* 
fore  forms  the  mechanism  to  skip  to  the  main  header,  if  there  is 
no  desire  to  read  any  of  the  transpon  fields,  and  if  the  main 
header  is  not  scrambled  due  to  error  proteaion,  ene^tion,  or 
other  operations  from  the  transpon  fields.  T^  field  is  also  pr^ 
tened  by  a  32*bii  field,  using  a  aonscrambling  eiTor*prottctioa 
code. 

A.7.4  T^fpt  FiMs 

There  are  eight  4iWt  type  fields,  to  allow  16  types  for  each  of 
the  eight  fields  in  the  transpon  header.  These  fields  are  (1)  syne. 
(2)  error  proteaion  and  correction.  (3)  {niority  and  authorteadoB 
and  bidding  for  priority,  (4)  autborization  for  dau  use.  (5)  en* 
cryption  proteaion.  (6)  sequence  number  and  out^<oider  dm* 
ing  margins.  (7)  timing  reconstruction  information,  and  (8)  a  final 
field  reserved  for  ftiture  use.  The  32  bits  of  type  fields  are  protea* 
ed  by  32  bits  of  protecnoa/oorreoion  code. 

A.7J  Syne 

The  first  traiupon  operation  field  is  syne  reinforcement.  Set* 
uen  types  of  codes  are  possible,  each  with  a  peimanentiy  as* 
signed  length.  These  tv]^  and  lengths  must  be  permanently 
assigned  from  the  beginning.  A  possible  set  of  IdasdgnraenafBr 
lengths  might  be  as  follows. 

Two  types  of  sync  codes  could  be  used,  whh  custom’desifned 
unique  spearal  signatures.  Each  of  the  two  types  of  eodes  could 
have  one  of  the  following  lengths: 

1  4. 116. 32.  64. 121  and  2S6  bytes 

This  would  result  in  a  touJ  of  14  syne  reinforcement  patterai. 
Syne  type  0  would  indicate  an  absence  of  the  sync  reinforcenem 
field.  Sync  type  15  could  represent  an  additional  fecial  qmc 
field,  with  a  specified  length. 

It  is  necessary  for  all  sync  type  lengths  to  be  specified  in  ad* 
vanee.  Although  the  eodes  for  syne  themselves  can  be  spcdlicd 
later,  they  can  only  be  specified  once  for  each  of  the  15  valid  aypc 

VPes- 

It  should  be  noted  that  the  transport  header  always  begins 
exaaly  26  bytes  prior  to  the  first  byte  of  the  syne  reinforcement 
field.  Once  the  syne  reinforcement  field  has  been  located,  the 
transpon  header  and  the  main  header  have  been  located. 

A.7.6  Error  Protection 

None  of  the  fields  previously  described,  whkh  precede  the 
crror-eorreaioivproteaion  field,  will  be  proteaed  by  this  fieldL 
However,  all  of  the  fields  foUowing  and  including  this  error* 
proteoion/eonreoion  field,  staning  at  the  first  bit.  will  be  pan  of 
the  erTor*proteaion/eorreaion  group.  The  protection  will  exmnd 
through  the  rest  of  the  transpon  header  and  on  into  the  mam 
header  and  hs  descriptor,  and  for  some  of  the  formats,  into  the 
peyload  as  well 

Since  syne  reinforeemem  h  loed  to  capture  the  dau  sieam 
teitially.  h  is  probably  not  appropriate  to  error*protea  this  qmc: 
The  syne  eodes  are  designed  to  be  found  within  a  stream,  tf 
protection  is  needed  for  the  syne  field,  then  special  syne  pretcc* 
tien  cu  be  provided  within  the  ample  byte  available  for  qme 

Two  types  of  error  proteaion  are  supponed.  One  type  allows 
the  proteaed  dau  fields  to  be  read,  as  ii  ksv^  them  "dear"  but 
augmenting  them  with  proteaion.  Examples  of  this  type  of  code 
are  the  Hamming  code  and  some  common  forms  of  the  Read* 
Solomon  code.  The  second  type  scrambles  all  of  ibe  bits  in  the 
coding  process.  An  example  of  this  type  of  code  is  a  convolutk^ 
one  such  as  the  Viterbi  code.  In  the  ease  of  scrambled  proteoioo. 
the  fields  being  proteaed  will  be  completely  unreadable  withoat 
decoding.  If  the  header  fonnat.  the  header  length  field,  ite  d^ 
sexiptor  type,  and  many  other  erudal  fields  are  proteaed  in  this 
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way.  those  devices  that  cannot  decode  the  error  correnion  would 
be  unable  to  either  read  or  skip  the  header.  Thus,  the  transport 
header  will  contain  a  'lenph  Held.'  protected  in  the  augtnenu* 
method  rather  than  the  scrambling  method,  which  will  allow 
devices  to  find  this  length  field  and  thereby  skip  the  rest  of  the 
transport  header  and  the  entire  main  header,  its  descriptor,  and 
its  payload.  A  second  "length  of  transport  header"  field  will  also 
be  present  and  proteaed  via  the  "in  the  dear"  augmentation 
method. 

A.7.6.J  Imenxional  InfledbilUy 
In  order  for  ail  of  the  above  mechanisms  to  operate  properly 
and  eflicientiy.  it  is  necessary  to  limit  the  number  of  fbtmis 
•vaiiabie  in  the  transport  header.  Perhaps  8  or  16  types  of  each  of 
the  fields  should  be  provided  for.  with  all  types  being  speaafied  in 
advance.  We  are  us^  16  as  our  example.  Thus,  there  would  be 
16  enor<<errecting  coded  formats  and  {.  code  lengths,  16  sync 
reinfereement  field  formau  and  lengths,  16  priority  fields  and 
lengths,  etc,  with  each  field  format  having  a  specified  length. 

Snee  the  error  eorrecaon  transport  ftinetioo  is  the  most  diffi* 
cuh  with  respect  to  format,  a  veiy  limited  number  of  field  options 
will  be  provided  under  the  sco^  of  protection.  Also,  padding, 
which  might  be  quite  long,  will  be  required  at  the  end  of  the 
transport  header  before  the  main  header  in  order  to  make  the 
total  length  of  the  transport  header  plus  the  main  header  and  its 
descriptor  (and  possibly  payload)  to  be  a  simple>io-eorrect  con¬ 
venient  known  length  oorropondlng  to  one  of  the  16  protection 
types. 

/L7.6.2  Possible  Prespeeified  Proteaien  Types 

'  The  prespecified  16  possible  protection  types  might  be  as 

follows: 

•  Type  0  indicates  that  no  procection/corTection  field  is 
present 

•  Types  I  through  10  protect  "in  the  dear."  by  adding  protec¬ 
tion  bits  without  scrambling,  as  in  Hamming  codes  and  some 
common  types  of  Reed-Solomon  coding. 

•  Types  1  through  5  protect  the  transport  header,  the  main 
header,  and  its  descriptor. 

The  types  are  as  follows: 

1.  Protect  all  remaining  transpon  header  bits,  beginning  at  the 
first  bit  of  the  erTor-proteetiotveorreerion  field,  ail  main  header 
bits,  and  all  descriptor  bits.  Oo  not  protect  any  payload  bits.  The 
proteetton/correnion  bits  are  ^piied  on  every  group  of  64  bits. 
The  total  length  of  ail  fields  being  protected,  exduding  the  error- 
code  bits  themselves,  must  be  a  muitipie  of  64  bits.  This  is  aeeom- 
pitshed  by  the  use  of  pad  bits  at  the  end  of  the  transpon  header. 

This  implemenution  requires  that  memoty  be  available  to 
store  the  correaion  biu  for  the  entire  length  of  transpon  header, 
main  header,  and  its  descriptor. 

For  type  1.  the  total  length  of  the  protection  code  field  is  the 
total  iea^  over  16.  with  4  bits  for  every  64  (68  bhs  total  00  64  bitt 
of  d&ti). 

1  Same  as  1.  but  total  length  over  8.  with  8  bits  protecting 
every  64  (73  biu  total  for  64  bitt  of  dau). 

3. ’ Same  as  1.  but  total  length  over  4,  with  16  bits  protecting 
every  64  (80  bhs  toul  for  64  bits  of  data). 

4. '  Same  as  1.  but  total  length  over  3  (half  as  long  as  the  fields 
being  protected);  32  bits  proiecung  every  64  (96  bus  total  for  64 
bits  of  dau). 

5.  Same  as  1.  but  total  length  over  2  (the  same  length  as  the 
fields  being  protected):  64  bits  proteaing  cveiy  64  (128  bits  total 
for  64  bitt  of  data). 

Types  6  through  10  protea  the  paytoad  in  addition  to  the 
tnii^n  header,  the  main  header,  and  its  descriptor. 

6.  Same  as  type  1.  except  pmtea  the  payload  as  well  For  type 
6.  the  total  length  of  the  protecion  code  field  is  the  total  length 
over  16.  with  4  bhs  pioteaing  every  64  (68  bits  total  64  bits  of 
data). 

7.  Same  as  6.  but  total  length  over  8.  with  8  bits  ptoieciing 


every  64  (72  bits  total  tor  64  bits  ot  dau). 

8.  Same  as  6.  but  total  length  ov  er  4.  with  16  bits  protecting 
every  64  (80  bits  toul  for  o4  bits  ot  ^ta). 

9.  Same  as  6  but  total  length  over  3  (half  as  long  as  the  fields 
being  proteoed);  32  bits  protecting  every  64  (96  bits  toul  for  64 
bits  of  daui. 

10.  Same  as  6.  but  toul  length  over  2  (the  same  length  as  the 
fields  being  proteaed):  64  bits  proteaing  every  64  (128  bis  total 
for  64  bits  ot  dauk 

Types  11  through  15  protea  by  se^bling,  u  in  convohwioii 
oodhg  techniques  such  as  Viterbi  coding.  Typa  11  and  12  protea 
the  transport  header,  the  main  header,  and  hs  descriptor. 

11.  Protea  ail  remaining  transpon  header  bits,  begmnini  at 
the  first  bit  of  the  proteetion/correaion  field,  all  main  header  bio. 
and  ail  descriptor  bits.  Do  not  proiea  any  payload  bits.  The 
proteeiion  is  appUed  using  16  bytes  on  every  poup  of  144  bytes. 
The  total  length  of  lU  fields  being  protecud.  inciuding  the  error- 
code  bits  themselves,  must  be  a  multiple  of  144  byus.  This  ii 
accomplished  by  the  use  of  pad  bits  at  the  end  of  the  canipoft 
header. 

This  implemenution  requires  that  memory  be  available  to 
store  the  correaion  bytes  for  the  entire  length  of  tnnspon  head¬ 
er,  main  header,  and  its  descriptor.  For  type  1 1,  the  total  length  of 
the  proteaion  code  is  16  bytes  for  every  144.  Thus,  there  are  16 
extra  proteaion  bytes  in  each  144  bytes  being  stored,  muhiag  in 
128  bytes  of  dau  aher  decoding. 

12.  Same  as  11.  but  with  16  bytes  proteaing  every  80  bytes, 
resulting  in  64  bytes  of  dau  aher  decoding.  The  toul  of  all 
lengths.*  beginning  at  the  tlrst  bit  of  the  proteaioivconectioa 
field  must  be  a  muitipie  of  80  bytes. 

Types  13  and  14  protea  the  payload  in  addition  to  the  trans¬ 
pon  header,  the  main  header,  and  its  descriptor. 

13.  Same  as  11.  except  the  payload  is  also  proteaed. 

14.  Same  as  12.  except  the  payload  is  also  protected. 

15.  Same  as  11.  except  4  bytes  protea  32  bytes  (toul  of  36 
bytes  for  32  bytes  of  dau).  This  format  is  for  shon  header  for¬ 
mats.  and  prmndes  oo  payload  proteaioa 

A.7.6.3  Interieoving 

In  addition  to  the  above  mechanisms  for  error  protection,  wnc 
of  the  error-proteaion  formats  can  invoke  interleaving.  Imertem^ 
mg  can  substantially  reduce  the  problems  associated  with  bunt 
errors.  Although  the  initial  pan  of  the  transpon  header  is  subjea 
to  being  "wiped  out"  by  a  burst  error,  presumably  a  copy  of  thit 
seaion  could  be  available  prevfously  in  a  redundancy  transpon 
header.  Thus,  once  the  proteaion  format  has  been  detennined. 
then  the  rest  of  the  transpon  header,  beginning  at  the  enor- 
proteaion  field,  plus  the  main  header,  its  descriptor,  if  ptesenc 
and  optionally  the  dau  payload,  can  all  be  protecud  froin  bum 
errors  via  interleaving  in  ^ition  to  error<orreaion  uethetk 

Predefined  interleaving  methods  cut  be  mcorporaied  with 
some  of  the  types  disoisud  above.  Becauu  interleaving  is  Skeiy 
to  invoive  as  wide  e  spacing  as  is  feasible,  there  will  be  a  tradeoff 
between  the  length  of  the  proteaed  field,  and  natural  uultipies 
of  the  error-protection  sizes.  The  error-protection  group  sta  for 
Hammin|-(ype  cedes  are  much  smaller  than  the  uiui  protretion 
group  sizes  for  Reed-Solomen-type  codes.  Interleavini  must  be 
some  multiple  larger  again.  Thus,  useful  interleaving  nay  be 
restriaed  to  longer  lengths  of  fields  being  proteaed.  One  poni- 
biliiy  is  to  have  the  interleaving  spacing  be  the  enor-pioweiiaa 
group  size  divided  into  the  toul  length. 

However,  this  near<cptimal  format  requires  some  eompleaiiy 
in  unwinding  the  inicrieavin|.  For  long  proteaed  fields,  thh  may 
also  involve  a  buffer  that  is  the  length  of  the  field.  Thus,  there  are 
potential  issues  to  investigate  with  respea  to  how  to  univcRally 
and  generally  specify  a  powerful  interieaving  technique. 

A.7.6.d  Error  Dettaien 

There  is  no  proviston  for  simple  error  deteaion  in  the  above 
type  examples.  Such  deteaion  could  be  provided  via  tydkal 
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redundancy  code  (CRC).  5re  code,  checksum,  pahty,  or  other 
check  method.  Such  may  be  jaefui  in  some  eases.  However,  the 
focus  on  enor  correction  is  tSsed  on  the  need  for  headers  to  be 
interpreted  without  error  in  order  to  serve  their  fonoion. 

The  descriptor  in  the  main  header  can  be  used  for  detection 
codes  for  dau  payloads  that  should  be  checked  but  need  not  be 
corrected.  This  need  not  be  standard<speeifie.  sinee  the  deschp* 
tor  can  be  standard>independent.  Thus,  error  detection,  as  op* 
posed  to  correnion.  is  more  appropriate  in  the  dotriptor  than  in 
the  transport  header. 

A.7.6J  Panmeurs  of  Emr Prouaion 

The  parameters  of  error  detection  shMm  in  the  above  type 


T^rpeO.  I  byte 


I  Pzleriey  I 
I  I  bita  I 


Type  1,  2  byeea: 


I  Pziority  I 
I  1C  bita  I 


Type  2.  4  byres: 


I  Pzierity  I  Autherizatlen  l 
I  1C  bita  I  1C  bits  I 


Type  3»  ■  bytea: 


I  Priority  I  Bid/Value  I  Authorization  I 
I  1C  bita  I  1C  bits  I  32  bits  I 


Type  4,  1C  bytes: 


I  Priority  I  bid/VaXue  I  Authexizatien  | 
I  4  libyteal  4  bytes  I  I  bytes  I 


Type  S,  32  bytes: 


I  Priority  I  lid/Value  I  Antherizatien  i 
I  4  bytes  I  4  bytes  |  24  bytes  | 


Type  C,  C4  bytes: 


I  Priority  1  Bid/Value  I  Aatberiaatien  | 
I  •  bytes  I  •  bytes  I  4C  bytss  U  I 


Type  7,  121  bytes: 


I  Priority  I  Bld/Value  |  Autberisatien  | 
I  I  bytss  I  ■  bytss  I  112  bytss  I 


etc. 


examples  need  further  investigation  and  refinement  The  lengths 
and  protection  ratios  proposed  are  known  to  be  implementable  in 
existing  hardware  and  are  expected  to  be  eorrvenient  in  pranice. 
However,  further  investigation  of  optimal  parameters  for  error 
protection  may  help  refine  or  revise  the  parameters  suggested 
above. 

A.7,7  Priority  and  Aiuhorization  or  Bid  for  Priority 

Finite  bandwidth  resources,  such  as  satellite  channels,  long 
fiber  channels,  long  reai*time  computer  channels,  terrestrial 
broadcast  channels,  and  other  channels  with  long  disunees  cause 
long  latency,  which  naturally  prevents  error>retiy.  Thus,  channel 
bandwidth  allocation  near  saturation  on  teaJ*tiiBe  inageiy 
streams  takes  the  form  of  packet  collisions.  Such  packets  are  most 
naturally  the  header/desoriptor/payioad  eombinatioiL  since  each 
can  have  its  own  priority  and  forms  a  eonsunt  priority 
pouping.  The  eonsunt  priority  grouping  would  be  the  construc¬ 
tion  used  by  the  origirutor. 

When  sharing  a  fi^te*bandwidth  channel  it  may  be  neoesary 
to  pass  some  dau  and  drop  other  data.  In  order  for  the  channel’s 
eontrolling  device  to  determine  fairly  which  packes  to  pass  and 
which  to  drop,  priorities  for  paekeu  might  be  provided.  In  many 
spaual*frequency*based  compressed  imagery  formats  such  as 
DCT.  subband,  and  wavelets,  the  high  fiequendes  represent  tiny 
picture  deuii  that  might  be  lost  without  much  picture  degrada¬ 
tion.  However,  the  ^tiaJ  low  fiequeneies.  audia  and  motiem 
veaors  must  be  heavily  protected  and  may  not  be  dropped  with¬ 
out  visible  artifacts. 

The  type  field  would  specify  the  length  and  format  of  the 
priority  and/or  authorization  fields  that  follow.  The  length  might 
have  2*type  length  (2  to  the  power  of  the  value  in  the  type  field, 
being  1 4. 8. 16. 32  64, 128.  etc.,  bytes}.  Type  0  still  repiwntt  the 
absence  of  the  priority  field. 

Since  the  priority  and  their  authorization  fields  will  oompeu  at 
the  highest  level  it  will  be  necessary  for  us  to  define  their  mean¬ 
ing  at  the  outset.  We  will  further  need  to  define  the  mappings 
between  the  shorter  and  longer  versions  of  each  field. 

The  format  of  the  fields  might  be  as  shown  in  Rg.  AS.  The 
priority  field  varies  from  1  to  8  bytes,  allowing  very  detailed 
priority  levels. 

The  Bid/Value  field  allows  a  packet  to  have  a  “bidding  price'  in 
a  collision  with  other  packets.  Such  a  bidding  price  would  be  a 
value  if  the  price  had  previously  been  accepted.  A  value  would 
imply  that  tossing  the  packet  would  break  a  eontraa  for  deiivety 
to  the  packet.  The  meanings  of  the  Bid/VaJue  fields  would  be 
tied  directly  to  authorization  codes,  which  vimuid  indicate  the 
following: 

1.  Whether  the  header  was  authorized  to  bid. 

2  The  ’‘credit  rating”  (or  importance)  of  the  bidder.  This 
could  potentially  weight  the  priority  field. 

3.  Whether  the  bid  had  bem  previously  accepted,  so  that  the 
Bid/Vaiue  field  was  the  value  paid  for  the  payload's  deiivety.  In 
such  a  case,  losing  the  packet  would  vtoiate  the  eontracL  Pre¬ 
sumably  such  a  case  would  occur  only  when  more  contracts  had 
been  nude  tlun  were  availabie.  so  that  packets  were  only  tosed 
by  other  similar  accepted-bid  packets  with  an  established  value. 
This  is  a  similar  problem  to  "overbooking"  on  airlines. 

4.  Authorization  may  affect  the  bidfSraiue.  If  eommisions  are 
paid  on  some  bids  or  values,  and  not  on  others,  the  net  bid  or 
value  may  differ.  This  is  similar  to  the  problem  of  bids  in  different 
currencies,  or  direa  bookings  versus  using  agents.  Thus  authori¬ 
zation  can  indicate  the  source  and/or  type  of  a  bid  for  these 
purposes. 

The  priority  should  be  registered  in  entirety.  Thus,  the  mean¬ 
ing  of  prioriiy  codes  might  be  ^fined  by  registration.  However,  it 
may  be  desirable  to  have  priorities  uke  simple  linear  precedence 
order,  with  higher  values  representing  higher  priorities.  One  pos- 
rible  solution  is  to  define  the  first,  or  the  first  and  second  bytes  of 
the  priority  to  be  linear  magnitude  precedence  priority  codes. 
Subsequent  bytes,  however,  could  be  registered  eedn.  with 


figure  A5.  ficiO  totmea. 
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unique  meanings  that  are  standardized  to  help  resolve  priority 
conflicts. 

Other  than  the  potential  interaaions  of  authorization  on  prior* 
iiy  and  hid^'ilue.  authorization  can  have  the  follt^ving  very  im* 
ponant  uses. 

A.7.i  7Ju  Auikoruetion  Fitld 
Uses  of  authorizadon: 

1.  Pay*pcr*view  target  encryption  codes  (in  lock  step  with  re¬ 
ceiving  system). 

1  Qtannel  authorization.  For  eaatnple.  is  a  satellite  downlink 
dtannei  signal  authorized  for  use  as  a  cable  head-end? 

3.  Oiannel  routing  authorizations.  For  tnstanee,  are  aH  au¬ 
thorized  destinations  only  on  network  fork  A.  such  that  a  source 
for  subnetworks  A  and  B  need  not  carry  the  paytoad  to  sub- 

netuork  B.  Tills  is  the  function  of  suppottiag  a  subset  of  all  of  the 

outputs  involved  in  a  y  connection. 

4.  End-user  authorization  for  teleconferencing,  to  if»d?‘raig 

who  can  be  included  in  the  teleconference.  who  may 

observe  and  who  may  originate. 

5.  Privacy  and  protection  against  unauthorized  acceptance  or 
oripnation  of  the  signal  in  any  use.  For  eaatnple.  protection 
against  real-time  datastream  hackers  or  unauthorized  vido 
phone  wire  apping. 

6.  Authorized  user  enablement  codes.  Such  codes  would  au¬ 
thorize  user  systems  for  future  authorized  codes.  For  eaample, 
when  a  cable  subscriber  adds  a  new  channel,  an  enablement  code 
would  be  sent  to  the  decoder  to  add  authorization  interpreution 
and  viewing  for  the  new  channel. 

7.  Diagnostic,  statistic,  and  rating  codes  for  exploring  network 
loads,  active  users,  show  ratings,  unintentional  network  discon¬ 
nects.  channel  error  rates,  etc. 

8.  Copyright  information  indicating  ownership. 

9.  Qsp^ght  fee  structures,  induding  where  to  pay  fees. 

10.  Indications  of  who  may  edit  a  work,  whether  it  may  be 
induded  in  other  works,  and  fee  structures  for  doing  so. 

11.  Possibly  an  automated  mechanism  could  be  constructed  to 
automatically  negotiate  rights  based  on  prearranged  *Nrilling  to 
pay'  algorithms,  so  that  dips  can  be  induded  without  undue 
complication. 

A.7.9  EiteryptwM 

One  fonaion  of  the  transport  header  is  to  provide  one  or  more 
encryption  keys  for  deciphering  the  payload  and  descriptor. 
Many  protected  users  may  wish  to  proien  against  unauthorized 
deciphering  of  the  descriptor,  since  it  may  eonuin  valuable  infor¬ 
mation  that  could  help  in  dedphering  the  payload.  Codes  could 
be  used  for  encryption  keys,  for  example,  to  unlock  descriptors. 
Descriptors,  in  turn,  may  eonuin  more  daborate  encryption 
codes  to  further  unlock  the  payload. 

Based  upon  successful  authorization  code  interactions,  ertciyp- 
tion  codes  can  be  deciphered  and  applied  against  the  dcseripior, 
the  payload,  or  both.  As  usual  a  type  0  represenu  that  no  encryp¬ 
tion  field  is  present.  Fifteen  types  are  available,  with  15  presp^ 
had  associated  lengths,  for  encryption.  Although  the  len^  must 
be  prespecified.  the  meaning  of  the  encryption,  or  iu  associated 
technique,  can  be  compleufy  private.  Complex  encryption  algo 
rithitts  can  be  developed  and  up^ted  between  embedded  cedes 
in  the  receiving  device,  codes  in  the  descriptor,  and  possibly 
algorithms  transmitted  and  updated  via  deseriptots. 

A,7J0  Sefuener  Nmmbm 

The  sequence  numbering  field  spedfies  not  only  packet  order¬ 
ing,  but  also  windows  of  order  and  groupings.  For  example,  in 
seme  systems  various  headen  and  their  associated  payloads  form 
packets  that  can  update  the  screen  in  any  order  during  the  frame 
titne  before  the  buffer  switch  for  viewing  Hmvever.  notion  vec¬ 
tors  might  need  to  precede  compressed  image  deltas.  Thus  not 
only  packet  grouping,  but  packet  general  ordering  night  be 
specified. 


On  some  networks,  lower-pnoriiy  packets  are  delayed,  rather 
than  dropped.  In  such  networks,  it  is  necessary  for  the  network 
controlling  mechanisms  to  understand  the  bounds  on  acceptable 
delay  for  packets  and  groups  oi  packets  This  field  contains  a 
series  of  codes  for  defining  tolerance  of  irrugery  and  other  real- 
tine  streams  for  being  received  out  of  order.  Type  0  neant  no 
sequence  field  is  presenL  Fifteen  valid  type  fields  with  the  anori- 
ated  prespeufied  lengths  are  available. 

A.7.II  Tlmuif  JbemnweeiM 

In  real-thne  dau  soeans,  it  is  often  necessary  to  reeonsmicl 
mesM  timing  after  this  timing  is  disupted  during  oanspon. 
Tbntng  reconseuerion  informatien.  ceneemini  the  tnws  at 
which  evens  should  occur,  are  specified  in  this  field.  Times  can 
be  specified  as  absolute  tunes,  where  the  transport  delays  and 
their  bounds  are  taiown.  Relative  tuna  can  be  specified  relative 
to  an  aitisaiy  “Stan  of  real-time  stream”  dock  marker,  wbiefa  ii 
set  by  the  reeeivtng  device  upon  receiving  the  fim  diiplqsble 
buffer  load. 

Synehrmiznioa  between  audio  and  imag^  between  nuhiple 
audio  streams,  or  between  streams  ftom  multiple  sources,  is  h^ 
died  via  the  timing  reconstruction  field.  Resynehronizatioo  for 
remwing  eumuiaove  jitter  effees  can  also  be  enabled  tbnmh 
the  use  of  this  field. 

A  type  of  0  indieates  an  absence  of  the  tuning  tcemmiaedea 
field.  The  15  available  codes  will  have  prespecified  iengths,  al¬ 
though  their  timing  meaninp  may  be  deferred  from  some  of  the 
types.  Of  course,  each  of  the  types  can  only  receive  a  ai^e 
meaning,  which  meaning  must  stay  in  place  from  then  on.  The 
lengths  for  such  unspediSed  codes  must  ail  be  specified  in  ad¬ 
vance.  however. 

A.7.22  Tbamtdfat  tha  Fwatn 

This  field  is  unspecified  in  content  and  length.  Baciuse  the 
total  length  of  the  transport  header  is  known,  and  because  this  ia 
the  last  field  in  the  header  prior  to  the  pad.  this  field  ^  cu 
maintain  flexibility  for  future  use  by  remaining  completely  unspe¬ 
cified.  All  other  fields  must  at  leas  have  their  lengths  specified 
for  each  type  value. 

A. 7.J3  FmdBia 

Pad  bits  make  the  lengths  simple  for  error-eorreetion  ptoceai- 
ing  This  is  accomplished  by  making  the  total  of  the  ertor-cotrec- 
tion/proteetion  field  range  be  the  appropriate  length  for  dm 
error-correction  format  being  used.  For  example,  using  a  «  bis 
length  type,  the  length  after  the  error-protecoowcorreoioD  fisdd 
mus  be  a  mul^le  of  64  bits.  Thus  if  the  scope  of  the  proteedoa 
tnciudes  additionai  ttanspon  fields,  such  as  priority  and  enoy^ 
lion,  plus  a  header  and  ia  descriptor,  the  pad  bia  would  make  the 
sum  a  proper  multipie  of  64  bits. 

Appendix  B  —  Ulustrativt  Ezasnplcs  of  Header 
Deling  Using 

B. 0  Badtgroand 

It  IS  often  instructive  to  represent  a  design  as  a  computer 
program  written  in  some  appropriate  language  (in  this  ease  Q.  b 
verifies  the  design  and  provides  a  basis  for  comparing  the  eon  and 
performance  of  design  alternatives.  The  C  language  was  chosen 
because  it  is  reasonably  universal.  Conciseness  and  eonsisteney 
am  foremost  eoRstdetations  in  enabling  comprehension  and  fob 
eofflparison:  epiimai  performance  is  of  secondary  hnponanee. 
Optimizations  and  enhancemena  would  be  added  in  preparatioo 
for  commercial  distribution. 

Two  programs  are  described.  One  decodes  a  compyt  header 
and  one  decodes  an  ASN.l  header.  They  are  similar  in  apptar 
ance  and  use  the  same  basic  steps.  The  primary  difference  is 
compact  header  decoder  seieca  between  muit^le  formaa  using 
table  lookups.  whUe  an  ASN.l  header  has  only  one  wtmiible 
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Ibnnat.  Each  prograni  exiracu  the  block  length  and  sundard 
idennrter  from  the  header,  and  then  eaiu  a  corresponding  func¬ 
tion  to  process  the  payload. 

B.1  Compact  Header  Decoder 

The  follOMring  program  decodes  a  packet  with  a  eompaa  head¬ 
er.  If  the  packet  fonnat  is  predefined,  it  calls  the  corresponding 
predefined  function,  otherwise  it  eziracis  the  standard  identifier 
and  block  length  in  a  manner  similar  to  the  algorithm  described  in 
Section  It  uses  the  identifier  to  look  up  a  decoding  funaion 
(/),  and  ignores  any  blocks  with  unknown  ittentifen. 

Two  ubie  lookups  are  used  to  decode  the  compact  header. 
The  length-type  table  (It  table)  contains  infonnation  used  to 
decode  predefined  messages  and  block  length.  The  identifier 
^le  (id  table)  contains  information  used  to  decode  the  standard 
identifier.  One  ohrious  opdmiation  is  to  combine  the  two  table 
lookups  into  a  singie  2S6-entry  lookup.  This  reduces  the  instruc¬ 
tion  path  for  some  cases,  but  increases  memory  requirements. 

The  idemifier  is  left  as  a  string  of  bytes  used  to  compute  a  hash 
table  lookup  of  a  decoding  fusion.  If  a  sovereign  sute  field 
exists,  it  is  processed  together  with  the  standard  identifier,  but  a 
separate  hash  table  is  used.  The  hash  table  lookup  is  performed 
by  the  procedure  lookupO.  which  takes  identifier  address,  identi¬ 
fier  length,  and  ubIe  selection  as  aif  uments.  and  mtunu  a  point¬ 
er  to  the  corresponding  function  (/). 


*  i id  byte 

*  p  ::  payload  byte 

■ 

•/ 

extern  void  fakcO:  *fake  predefined  func  to  init  It  ubie*/ 

extern  void  nuilO:  ’null  func  for  unused  funaion  entries*/ 


lypedef  strua  { 
char  length: 
void  (*funaion)0: 
char  id  oSaeu 
}Li  entry; 


*It  table  strunure*/ 

.'block  length  or  length  of  length  field*/ 
“predefined  funaion  for  It  0«5*/ 
/*ofiset  to  idem  field  for  b  6-IS*/ 


Lx  emiv  it  tabie(16]  •  {  '*lt  table  dedaradonV 

{2.  fake.  0).  {1  fake.  0).  {4.  fake.  0),  {5.  fake,  0).  nUV 

{6.  fake.  0}.  {a  fake.  0).  {I,  null  3).  {1.  null  4),  r4.7*/ 

{1  null  4}.  (2  nulL  5}.  {4.  nuU.  6}.  {4.  null  8).  rS-llV 

{6,  nulL  8).  {6,  nuU.  10}.  {a  nulL  0}.  {a  nulL  0}.  n2.15*/ 


lypedef  smia{ 
char  length: 
char  table: 
}Identty; 


rid  ttWe  wucnire*/ 

/■length  of  idem  field*/ 

/Svhieh  able  to  use  in  lookup*/ 


SJ./  Caaiaimn  Sous 

Certain  header  formats  are  not  yet  defined  or  are  reserved  for 
future  U.SC.  The  prognim  below  dm  not  support  these  fomuts. 

Predetined  message  types  have  not  yet  been  sundardized.  To 
make  the  code  complete  a  dummy  funaion  call  fakeO.  has  been 
used.  When  the  funaions  were  standardized,  the  It  ubie  would 
change  aceordingiy. 

Block  length  is  assumed  to  fit  within  one  33-bit  word.  Extend- 
mg  the  program  and/or  the  C  language  to  support  larger  word 
sizes,  thus  iaiger  block  lengths,  s  posible  and  likely  to  happen  as 
64-bit  processor  arehiteaurcs  emerge. 

Bit-field  ordering  and  assignment  are  not  yet  defined  Choices 
made  in  the  program  below  will  require  further  eonsideration  in 
the  eoniext  of  standardization. 

If  an  unknown  identifier  is  eneountered  the  lookup  funaion 
will  return  a  pointer  to  an  appropnaie  default  funaion  that  ig- 
fxtres  the  payload  and  displays  an  informative  message. 

Ai.3  Prograim  Tea 

The  program  has  two  parts  «•  the  first  part  eonuins  ubie  and 
procedure  declarations,  the  second  part  (at  the  end)  eonuins  the 
dozen  or  so  sutements  aaually  executed  Throughout  the  eode 
descriptive  notes  (eommenu  that  are  not  executed)  are  placed 
between  comment  delimiters  (/*.>*/). 

rCdmpaa  header  has  one  of  two  forms: 

• 

■Each  eharaoer  in  the  strings  below  represenu  a  byte:  bytes 
■between  square  btackeu  are  optional:  payload  bytes  are  not 
■eounted 

*3-byte  (minimum)  for  predefined  messages: 

■  ‘ke(p-4>r 


Idjentrv  idJable(81w{  '■id  ubie  declaration*/ 

{I.  0).  {1.  1).  {1  0}.  {3.  0}.  {1 :}.  {4.  :}.  (8.  2}.  {0.  0} 

}: 


enem  void  (*lookup0)0:  '■id  lookup  fiinaion*/ 


^EXECUTABLE  CODE  STARTS  HERE*/ 

reall  sundard  decoder  and 


int  decode  eompaa  header  (pkt) 

unsigned  char  *picc 

unsigned  char  *pwpku 
int  it: 
mt  length: 

Id  enw  *pid 
void  COO: 

lt«*p>>4; 

if(lt<6) 

{ 

length  wit  uble(It]. 
length: 

(■b  ttble(lt]iiinaion) 

(P): 

return  length; 


return  length*/ 

/■poimer  to  packet*/ 

Tworking  pointer  to  packet*/ 
/•length  type*/ 

/•block  length*/ 

/*id  table  pointer*/ 
/•standard  funaion  returned 
from  lookup*/ 

/•get  length  type  from  biu 
4>7  of  key*/ 

rproeess  predefined  for¬ 
mats:  It  <6*/ 

/■get  block  length  from  It  u- 
ble*/ 

/■eail  predefined  function*/ 


/■estrao  enough  bytes  to  cover  length  field  and  shift  out  unused 
bits*/ 


■Extended  header  fat  lotyr  Weeks: 

*  1el{_IlIe-eJil-jT 

•Key. 

*  k  z  key  byte  (length  type  and  id  type,  presence  of  a  readable 
■description) 

*  c  z  error  byte 
■  I  z  length  bye 


length ■((int)*(p4>2)<<24)  |  ((int)*(p>3)<  <16)  | 

((int)*(p'v4)<<g)  I  •(p>*'5); 
length  >>w32»(ta  iable(lt)Jength*8): 

pidw&id  ttble{*p  A  0x7]:  /■get  id  table  pv  usmg  bits  U.-2 

of  key*; 

p^  w b  tt6ie|b|Jd  ofieu  /■move  pointer  to  sun  of  id 

field*/ 

f ■  lookup! p.pid->leiigth.  •*k>okup  function*/ 

pid->uble); 
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/•  EXECUTABLE  CODE  STARTS  HERE 


p*  "pid-  >  length 
(*0(p*length  -  (int)(p-  pkt )): 

return  leneth: 

} 

BJ  ASN.l  Header  Decoder 
The  following  program  decodes  a  packet  with  an  ASN.l  head* 
er.  The  ASN.l  header  is  a  compatible  subset  of  the  standard 
ASN.l  EXTERNAL  type.  As  suted  in  Section  6.4.  not  all  the 
flexibility  of  the  standaid  EXTERNAL  type  is  needed  to  meet 
the  o^ectives.  Thus  this  decoder  has  been  specialized  to  support 
a  level  of  function  roughly  comparable  to  that  of  the  compaa 
header  decoder. 

Drroding  an  ASN.l  header  is  performed  by  parsing  a  ae* 
quence  of  tokens.  Block  length  is  defined  by  one  sequence,  stan¬ 
dard  identifieation  by  a  second.  Ute  header  has  short  or  extended 
forms.  The  shon  form  is  used  for  blocks  of  12S  bytes  or  less  and  is 
processed  by  extracting  block  length  from  a  sttigie  byte.  The 
extended  fom  is  processed  by  constructing  block  length  from 
multiple  bytes. 

The  identifier  is  left  as  a  string  of  bytes  used  to  compute  a  hash 
uble  lookup  of  a  decoding  hinction.  The  hash  table  lookup  is 
performed  by  the  procedure  lookupU.  which  ukes  identifier  ad¬ 
dress  and  length  and  returns  a  pointer  to  a  corresponding  func¬ 
tion  (/). 

B.2.1  CAuiionarj  Nota 

Context-dependent  headers  are  not  decoded  by  the  code 
below.  They  are  decoded  by  sundard  decode  procedures  at  a 
time  they  are  expeaed.  Block  length  is  assumed  to  fit  within  one 
32-bit  word.  Extending  the  program  aneUor  the  C  language  to 
suppon  larger  word  sizes,  thus  larger  block  lengths,  is  possible 
and  likely  to  happen  as  64-bit  processor  architeaures  emerge.  If 
an  unknown  identifier  is  encountered,  the  lookup  funaion  will 
return  a  pointer  to  an  appropriate  default  function  that  ignores 
the  payload  and  displays  an  informative  message. 

B  JJ  Prvgnm  Tta 

The  program  has  two  paru  >-  the  first  eonuins  variable  and 
procedure  declarations,  the  second  (at  the  end)  eonuins  the 
dozen  or  so  sutements  actually  executed.  Throughout  the  code 
desenptive  notes  (comments  tfiat  are  not  executed)  are  placed 
between  comment  delimiters  (/*.*/). 

/* ASN.l  header  has  one' of  three  forms: 

‘Each  character  in  the  strinp  below  represenu  a  byte:  bytes 
'between  square  brackets  are  optional:  payload  bytes  are  not 
'counted 

'2-byte  (minimuffi)  header  fbr  context  dependent  messages: 

•mi-ir 

'7-hvie  (minimum)  header  fbr  shon  Mocks? 

*  • 

'  -twq-i)tr 

'Extended  header  for  fonger  Mocks: 

'  -tH-J|tlil-.i)U|_ir 

•Key: 

*  I  :  ug  byte 

*  I  s  length  byte 

*  i  X  kl  byte 


/'move  pointer  to  lurt  of  pay- 
load*/ 

/'call  function  with  payload 
ptrJength'/ 


decode_asn  IJicadeit  pkt ) 

unsigned  char  *pkt: 

extern  void  (('Iookup0)0: 
unsigned  char  'pwpkc 
unsigned  int  lei^ith: 
void  COO; 

unsigned  int  t: 

length  ■'(pwi): 

if  (length  <127) 

^-•(p-r3): 

fwiooicup(p+44): 

(•0(P'rt+6Jenfth- 

(p-t-t-t-6-pkt)); 

return  length 't' 2: 


'call  standard  decoder  and 
return  length'/ 

.'pointer  to  packet'/ 

."id  lookup  function*/ 
/^•«rking  pointer  to  pnefcet* 
rbioek  length'/ 

/"standard  fiutetion  renimed 
from  lookup*/ 
rtemp  repster*/ 

rget  Is  length  byte*/ 
riook  for  shon  krmV 

/*gei  id  field  length*/ 
riookup  function*/ 

fcall  function*/ 

/*retum  total  length*/ 


t*  length -128: 


/*caic  length  of  length 


/"extract  enough  bvies  to  cover  length  field  and  shift  ooij 
biu'/ 


lenph«((int)'(p-r2)<<24)  |  ((int)'(p-^3)<  <16)  | 

((im)'(pw4)<<8)|'(p<.5 


lcngth>>»32-(t'8): 

p^  «t+3: 

falookup(p*  l.'p); 
p^  •  *p+'t^3: 

COfpJenph  -  (int )(p-  pkt)): 

return  length  1-^2: 

} 


Tmove  pointer  to  san  of  U 
field'/ 

rioekup  fonoion*/ 

/"move  pointer  to  san  of 
payload*/ 

/*^1  fune  (payload  pir  aid 
size)*/ 

/"return  total  length*/ 
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Preface 

For  some  time,  the  communities  pnnicipoiing  in  the  sun* 
dorJizaiion  activities  of  the  Society  of  Motion  Picture 
and  Television  Engineers  have  considered  the  role  of 
television  in  the  future  of  visual  communications.  In 
recent  years,  the  debate  h.TS  been  joined  by  members  of 
other  communities  affected  by  the  convergence  of  com¬ 
munication  and  imaging  technologies  enabled  by  a  com¬ 
mon  set  of  digiul  building  blocks. 

The  emergence  of  digital  coding  as  the  common  bn- 
guage  of  visual  communications  may  fundamentally 
change  our  view  of  the  world.  The  extent  to  which  this 
common  language  will  affect  life  in  the  21st  century  may 
be  even  more  profound  than  the  effect  that  the  medium  of 
television  has  had  on  life  in  the  20ih  century.  Television 
has  provided  a  window  to  the  world  -  often  real-time  - 
for  m.Tny  of  the  5.4  billion  inhabitants  of  this  pbnet.  This 
medium  of  cultural  and  information  exchange  has  en¬ 
abled  previously  isolated  populations  to  join  an  emerging 
global  village  -  one  increasingly  free  of  barriers.  The 
common  digital  language  offers  a  unique  opportunity  to 
leverage  converging  technologies,  such  as  television, 
computers  and  telecommunications,  into  a  global  com¬ 
munications  network.  Such  a  network  would  have  the 
potential  to  offer  a  vastly  augmented  range  of  services  to 
all  system  users,  thus  opening  up  new  markets  to  all  of 
the  affected  equipment  and  service  providers. 

Worldwide,  there  is  a  growing  consensus  that  the  time 
has  come  to  develop  standards  for  television  systems 
based  on  a  new  paradigm  -  appropriate  for  today  -  with 
forethought  to  future  requirements.  The  introduction  of 
digital  technology  into  imaging  indusuies.  togcincr  with 
the  widespread  introduction  of  digital  communications, 
creates  a  window  of  opportunity  to  esublish  a  digital 
image  architecture  with  unprecedented  freedom  of  appli¬ 
cation  and  interconnection. 

This  Report  examines  some  of  the  fundamcni.aJ  issues 
that  must  be  addressed  in  achieving  a  compatible  set  of 
standards  enabling  a  glob.ally  interconnected  and  inter¬ 
operable  visual  communications  network.  The  essenibl 
concepts  for  this  family  of  standards  include:  an  open 
(non-proprietary)  system  architecture,  interoperability, 
scalability,  and  extensibility.  It  is  hoped  that  this  Report 
will  stimubte  the  interest  of  many  groups  and  organiza¬ 
tions  involved  in  the  establishment  of  imaging  standards, 
today  and  in  the  future,  and  lead  to  agreement  on  a  single 
s>'siem,  nexiblc  enough  to  accommodate  a  wide  variety 
of  needs,  while  enabling  worldwide  interoperability. 

This  Report  was  prepared  by  the  SMPTE  Task  Force  on 
Digital  Image  Architecture  and  is  responsive  to  the  Work 
Assignment,  dated  April  IVW I .  which  e.stablished  the  fol¬ 
lowing  objective: 


To  develop  and  propose  a  suucture  for  a  hierarchy  of 
digital  sundards  to  faciliute  interoperation  of  high 
resolution  display  systems.  Tlte  resulting  system  of 
sundards  will  be  open  (non-proprietary),  scalable  to 
various  performance  levels,  extensible  to  new  tech¬ 
nologies  and  will  be  based  on  the  documenu  submit¬ 
ted  by  the  USA  to  the  October.  1990.  meeting  ofCCIR 
IWP-11/9.  taking  account  also  of  the  requirements 
and  constraints  noted  in  the  joint  Advanced  Television 
Systems  Committee  (ATSO  /  Institute  of  Electrical 
and  Electronic  Engineers  -  Committee  on  Communi¬ 
cations  and  Information  Policy  (TF.EF.-CCIP)  meetiiif 
on  digital  system  information  exchange  of  Mareh 
12-13, 1991. 

The  Task  Force,  early  in  its  considerations,  identified  the 
need  to  expand  the  Work  Assignment  to  include  all  rele¬ 
vant  aspects  of  digital  imaging  systems  -  acquisition, 
processing,  storage,  transmission,  reconstruction  and  dis¬ 
play  -  and  to  consider  systems  xross  a  much  wider 
range  of  resolutions  than  previously  planned.  This  was 
agreed  to  and  is  rellected  in  this  Report.  Requirements 
and  constraints  noted  in  SMPT^yiEEE/ATSC  coqMn- 
sored  digital  system  information  exchange  meetinp 
have  also  been  incorporated  as  appropriate. 

The  Report  is.  in  essence,  the  outcome  of  a  feasibility 
study  concerning  the  creation  of  standards  for  digital 
im.’ige  systems  that  are  scabble  and  extensible,  effecting 
a  high  level  of  interoperability  between  a  diverse  range 
of  industries  and  applications.  The  work  is,  as  yet.  in¬ 
complete:  however,  it  has  already  esublished  an  import¬ 
ant  though  preliminary  basis  for  a  family  of  digital 
imaging  standards.  The  Report  raises  many  new  ques¬ 
tions  and  identifies  additional  work  required  to  refine  the 
concepts  that  form  the  basis  of  a  digital  image  architec¬ 
ture.  Of  panicular  imporunce  will  be  the  selection  of 
source  and  dispby  refresh  rates  to  provide  performance 
and  economic  compatibility  with  today's  television  sys¬ 
tems. 

The  concepts  outlined  can  provide  a  basis  for  a  modular 
open  system  architecture,  in  which  the  parameters  and 
characteristics  for  each  module,  and  the  interfxes  be¬ 
tween  these  modules,  arc  clearly  defined  and  in  the  putn 
tic  domain. 

Such  a  system  should  use  common  standard  components 
to  serve  diverse  needs  xross  all  affected  industries.  It 
should  enable  the  movement  of  image  data  xross  appli¬ 
cation  and  industry  boundaries  without  degradation  and 
with  minimum  complication.  This  is  interoperability. 

Such  a  SN'stem  should  also  provide  the  ability  to  adjust 
inuigc  parameters  •  icmponil  and  spatial  resolution,  col- 
orimeuv  and  dxiuinic  range  -  by  varying  the  amount  of 
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data  that  is  stored,  iransmiiicd.  received,  or  displayed. 
This  is  scalability. 

A  digital  image  architecture  must  give  forethought  to 
evolution  -  to  incorporate  advances  in  technology  within 
any  module,  without  changes  to  any  other  module.  It 
must  be  backward  compatible  with  today's  systems,  and 
forward  enabled  to  accommodate  the  technology  explo> 
sions  of  the  21st  century.  This  is  aiensibiliiy. 


This  Report  was  prepared  by  a  Task  Force  chaired  ini¬ 
tially  by  David  Trzeinski  (PictureTel)  and  latterly  by  Dr.  , 

Will  Stackhouse  (Jet  Propulsion  Laboratory),  with  a 
wide  participation  from  the  computer,  television,  post 
production  and  telecommunications  industries.  A  de¬ 
tailed  list  of  the  membership  follows.  The  Repon  was  [ 

considered  by  the  SMPTE  Standards  Committee  at  its 
meeting  of  August  13.  1992.  and  subsequently  adopted 
after  an  in-depth  review. 
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1  Executive  Summary 

The  SMPTE  Task  Force  on  Digiul  Image  Archiicciurc 
was  charged  with  developing  and  proposing  a  structure 
for  a  hierarchy  of  digital  image  standards  that  would 
fociliute  interoperation  of  image  systems.  The  major 
objective  was  to  establish  the  basis  for  image  systems 
that  are  open,  scalable  and  extensible,  thus  meeting  the 
perceived  needs  for  image  communicauons  in  the  envi* 
ronment  likely  to  exist  as  computers,  television  and  com* 
munications  converge,  enabled  by  pervasive  digital 
technology. 

The  Task  Force,  formed  from  representatives  of  the  af¬ 
fected  industries  and  applications,  has  examined  the  is¬ 
sues.  setting  out  those  that  are  believed  critical  at  this 
time,  and  has  modelled,  for  discussion,  further  reitne- 
ment  and  testing,  one  possible  approach  that  meets  the 
basic  requirements.  It  has  also  produced  extertsive  tuto¬ 
rial  information  concerning  the  matters  under  consider¬ 
ation. 

The  key  concepts  of  the  appratch  arc  defined  in  Section 
2.  setting  the  conditions  for  image  systems  that  arc: 

-  Open  -  the  modules  and  interfaces  forming  the  archi¬ 
tecture  are  fully  defined  and  in  the  public  domain 

•  Interoperable  -  images  and  related  equipment  may 
move  freely  across  application  and  industry  bound¬ 
aries 

Such  systems  would  be  based  on  a  hierarchy  that  is: 

•  Scalable  -  supports  a  wide  range  of  image  capabilities 

•  Extensible  -  future-proof  to  the  extent  possible 

•  Compatible  -  supports  existing  television  practices 
and  standards  when  possible 

Current  and  future  image  systems  are  presented  and  ana¬ 
lyzed  in  Section  3  of  the  Report,  which  also  states  the 
main  objectives  of  the  Task  Force  activity: 

•  To  establish  the  fundomcnutl  properties  of  image 
systems 

•  To  examine  the  technological  trends  with  a  view  to  a 
prediction  of  future  capabilities 

•  To  arrive  at  architectural  guidelines  that  will  achieve 
the  objectives  of  interoperability,  scalability  and  ex¬ 
tensibility 

Section  3  of  the  Report  also  establishes  the  fundttmcnial 
concepts  upon  which  a  model  for  an  opett  digital  image 
arehitccture  can  be  constructed,  taking  into  consideration 
the  objectives  defined  above. 


Section  4  deuils  the  critical  issues  in  the  development  of 

a  suitable  image  architecture  meeting  the  stat^  objec¬ 
tives: 

•  The  establishment  of  scalable  and  interoperable  hier¬ 
archies  for  basic  image  parameters 

•  The  selection  of  a  family  of  image  xquisition  retes 
and  related  display  refresh  rates  based  on  a  progres¬ 
sion  that  permits  display  refresh  at  integer  multiples 
of  the  acquisition  rate.  Backward  compatibility,  if 
required,  to  the  image  acquisition  rates  currently  in 
use  (24. 50. 59.94  and  60)  should  be  accommodated 
in  the  design  of  the  standard  modules  which  will 
interface  these  existing  systems  with  the  digital  image 
architecture. 

•  Use  of  a  square  sampling  grid  as  a  simple  and  effec¬ 
tive  common  expression  of  images 

•  Selection  of  analysis  and  coding  schemes  for  color 
and  luminance  that  would  allow  useful  and  effective 
scaling  of  this  image  data  while  maintaining  a  high 
level  of  interoperability 

•  Coherent  sampling  of  the  image  based  on  the  use  of 
progressive  scanning  techniques 

•  Use  of  headers/descriptors  to  identify  the  content  and 
conform  to  the  characteristics  of  the  data  stream 

•  The  esublishment  of  appropriate  levels  of  compati¬ 
bility  with  current  television  and  motion  picture  stan¬ 
dards 

It  is  believed  that  this  approach  will  result  in  systems  that 
achieve  a  good  level  of  compatibility  with  current  televi¬ 
sion  and  imaging  systems,  while  placing  a  minimum  of 
constraints  on  the  path  to  the  future  (extensibility). 

A  model  of  an  open  architecture  approach  to  image  stan¬ 
dards  is  developed  in  Section  5.  one  that  is  both  compat¬ 
ible  with  the  present  and  extensible  to  the  future.  It  is 
based  on  a  low  order  hierarchical  approach,  using  image 
tiles.  The  model  defines  four  levels  of  resolution  and 
takes  account  of  a  number  of  possible  aspect  ratios  cur¬ 
rently  in  use.  Additional  analysis  is  provided  regarding 
the  selection  of  an  appropriate  family  of  image  acquisi¬ 
tion  rates  and  display  refresh  rates.  Finally,  a  scalable 
coding  approach  is  proposed  that  offers  the  ability  to 
produce  image  dam  in  packages  that  can  be  combined  to 
produce  images  at  a  variety  of  spatial  and  tempoml  res> 
lution.s. 

The  Ta.sk  Force  Report  is  expected  to  be  of  interest  across 
a  wide  range  of  indu.strics  and  applications.  Section  6 
examines  the  industries  likely  to  Ik  most  affected,  their 
specific  itntigiiig  needs  and  the  pos.sible  impticts  of  a 
defined  digital  image  architecture. 
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In  Secuon  7  ihe  Task  Force  suggests  additional  work  that 
must  be  completed  to  move  towards  a  full  impicmenta* 
tion  of  a  digital  image  architecture.  The  list  of  sugges* 
tions  included  in  Section  7  is  not  exhaustive:  it  is 
recognized  that  in  the  process  of  validating  the  architcc- 
tunl  concepts,  additional  areas  for  further  analysis  will 
be  identiried.  An  extensive  list  of  questions  is  included 
which  should  be  considered  in  the  process  of  csublishing 
standards  for  an  architecture. 

Ihe  suggestions  include  the  following  items  of  high  pri* 
oriiy: 

•  Development  of  strawman  applications  in  a  few  crit¬ 
ical  industries  with  a  view  to  validating  the  concepts 
of  on  architecture 

•  Detailed  analysis  of  potential  breakthroughs  in  image 
technologies  to  establish  a  solid  basis  Cor  extensibility 


•  Analysis,  simulation  and  optimization  of  proposals 
concerning  the  selection  of  a  family  of  image  xqui* 
siiion  Qtes  and  rebted  display  refresh  rates 

•  Derailed  analysis  of  the  possible  routes  to  colorimetry 
and  dynamic  range  expressions  that  allow  scalability 
and  interoperability.  This  work  wUl  require  simub- 
tion  and  subjective  evaluations  to  validate  the  results. 

A  considerable  amount  of  background  and  tutorial  mate¬ 
rial  was  developed  during  the  preparation  of  the  Repon. 
Some  of  it  is  believed  to  be  of  value  generally  or  for 
reference  in  future  work  on  the  development  of  the  digital 
image  architecture.  This  material  is  included  in  Section  8: 

•  A  glossary  of  terms 

•  An  analysis  ofsome  of  the  issues  surrounding  tempo¬ 
ral  rate  conversions 


ft 
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2  Key  Concepts 

2.1  Introduclion 

As  a  stoning  point  in  the  process  of  developing  and 
communicating  the  requirements  for  a  digital  image  ar> 
chiiecture.  it  is  important  to  csublish  a  clear  definition  of 
the  key  concepts  upon  which  the  architecture  is  to  be 
based.  In  many  cases,  existing  dennitions  must  be  en¬ 
hanced  to  bridge  the  g.np  between  current  practice  and 
future  requirements  embodied  in  the  architecture. 

Two  reference  documents  were  utilized  in  the  process  of 
■  creating  the  definitions  which  follow: 

•  The  IEEE  Dictionary.  ANSI/IEEE  Std  100-1988 

•  The  Final  Report  of  the  SMPTE  Task  Force  on  Hcad- 
ers/Descriptors.  January  3, 1992 

Definitions  obtained  from  the  IEEE  Dictionary  are  pre¬ 
sented  in  “quotations"  -  they  provide  a  reference  point 
for  the  expanded  definitions  developed  by  the  Task 
Force.  Definitions  presented  in  the  Repon  of  the  SlvfPTE 
Task  Force  on  Headers/Descriptors  proved  to  be  incom¬ 
plete  for  the  needs  of  this  Report,  due  to  the  expanded 
Work  Assignment  for  the  Task  Force  on  Digital  Image 
Architecture.  While  the  definitions  in  this  Report  are 
consistent  with  the  earlier  work  of  the  Task  Force  on 
Headers/Descriptors,  they  provide  an  expanded  under¬ 
standing  of  the  key  concepts  for  a  digital  image  architec¬ 
ture. 


2.2  Digital  Image  Architecture 

A  system  architecture  defines  “the  structure  and  rebtion- 
ship  among  the  components  of  a  system." 

One  of  the  m.njor  objectives  of  this  Report  is  to  define  a 
system  architecture  which  promotes  sharing  of  images 
and  equipment  across  applications  and  industry  bound¬ 
aries.  To  achieve  this  goal,  the  digital  image  architecture 
must  be  highly  flexible  to  deal  with  a  variety  of  diverse 
requirements,  including  the  evolution  of  technology. 

A  digiul  image  architecture  should  be  an  open  system, 
that  is.  one  made  up  of  functional  modules  with  standard, 
public  interfaces  which  can  be  assembled  into  a  func¬ 
tional  system  -  “a  set  of  interconnected  elements  coasti- 
tuted  to  achieve  a  given  objective  by  performing 
specified  functions."  Explicit  objectives  of  the  architec¬ 
ture  include: 

•  Backward  comp.atibility  with  existing  systems 

•  Ability  to  incorporate  technological  improvement 
and  evolution  over  time 

•  Risk  reduction  (mitigating  the  impact  of  incorrect 
predictions).  Individual  issues,  such  as  those  dis- 
eussed  in  this  Report,  may  be  cncapsuiaied  into  dis¬ 


crete  building  blocks  which  can  be  changed  without 
affecting  the  rest  of  the  system. 

The  digiul  image  architecture  supports  both  natural  and 
synthetic  inugery  including: 

•  Text 

•  Graphics 

•  Animation 

•  Still  images 

•  Motion  image  sequences 

•  Audio 

•  Related  image  dau 

A  key  feature  of  the  architecture  is  that  it  allows  decou¬ 
pling  of  the  system  into  functionai  modules.  The  func¬ 
tional  modules  of  the  architecture  are; 

•  Acquisition  or  generation  -  including  processing 
(transformation,  enhancement,  composition,  inter- 
preution)  and  storage 

•  Transmission  -  including  compression  and  storage 

•  Dispby  -  reconstructing  the  image  for  the  human 
visual  system,  including  hard  copy  print 

2.3  Interoperability 

Interoperability  is  the  sharing  of  images  and  equipment 
across  application  and  indusuy  boundaries.  When  deal¬ 
ing  wiih  digital  image  represenutions.  this  sharing 
should  be  facilitated  without  degrading  inuge  quality 
due  to  transformations  in  temporal  and  spatul  resolution, 
grid  geometry,  and  image  aspect  ratio. 

This  requires  careful  attention  to  the  definition  of  the 
interfaces  -  the  shared  boundaries  -  between  the  func¬ 
tional  modules. 

The  key  interface  definitions  are: 

•  A  data  definition  between  the  acquisition  and  display 
modules 

•  A  service  definition  between  acquisition  and  dispby 
modules  to  the  uansmission  module 

2.4  Hierarchy 

A  hicnurhicul  digital  image  architecture  is  one  in  which 
various  levels  of  performance  are  supponed: 

•  A  variety  of  spatial  resolutions,  aspect  ratios,  and 
ictnporai  rates 
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•  A  variety  of  colorimetry  and  dynamic  ranjjc  specifi¬ 
cations 

•  A  variety  of  image  encoding  standards. 

The  architecture  is  hierarchical  in  order  to  address  the 
requirements  for  scalability  and  cxicasibility. 

2.4.1  Scalability 

To  scale  is:  To  change  the  quantity  by  a  factor  in  order 
to  bring  its  range  within  prescribed  limits." 

Scalability  deals  with  the  ability  of  an  imaging  system  to 
adjust  the  level  of  performance  by  varying  the  amount  of 
data  that  is  stored,  transmitted,  received,  or  displayed  - 
up  to  the  maximum  resolution  that  was  originally  ac¬ 
quired.  A  number  of  specific  definitions  arc  implied: 

•  Transmit  Scalability  -  the  ability  to  encode  a  visual 
sequence  at  various  spatial  and/or  temporal  resolu¬ 
tions  to  conform  to  specific  transmission  require¬ 
ments.  These  requirements  may  include  support  for 
both  fixed  and  variable  bandwidth  channels,  and  spe- 

.  cific  transmission  standards.  Transmit  scalability  in- 
.  eludes  scalable  compression  ^  the  ability  to  encode  a 
visual  sequence  so  as  to  enable  the  decoding  of  the 
digital  data  stream  at  various  spatial  and/or  temporal 
..  .  resolutions.  Scalable  compression  techniques  typi¬ 
cally  filter  the  image  into  separate  bands  of  spatial 
and/or  temporal  data.  Appropriate  data  reduction 
techniques  are  then  applied  to  each  band  to  match  the 
response  characteristics  of  the  hum.on  visual  system. 


the  capacity  of  the  uansmission  subsystem  and  the 
economics  of  the  display  subsystem. 

•  Storage  Scalability  -  the  ability  to  employ  storage 
devices  of  different  C3p.acities  and  aceess  speeds 

•  Receiver  Scal.abiliiy  -  the  ability  of  a  decoder  to 
extract,  from  a  single  daD  stream,  only  that  data 
required  for  displaying  a  portion  of  the  image  and/or 
a  reduced  resolution  image,  either  spatially  or  tempo¬ 
rally 

•  Dispby  Scatability  -  the  ability  of  a  display  to  con¬ 
form  to  visual  sequences  of  varying  spatial  and/or 
temporal  resolution.  With  scanning  display  systems 
this  may  be  accomplished  by  changing  the  synchrev 
nization  rates  for  the  scanning  process,  or  by  creating 
a  window  that  utilizes  an  appropriate  suteet  of  the 
available  pixels. 

2.4.2  Extensibility 

Extensibility  in  the  design  of  a  hierarchical  digital  image 
architecture  allows  the  system  to  evolve  with  advances  in 
the  underlying  technologies  so  that  additional  levels  of 
performance  can  be  implemented,  without  rendering  ob¬ 
solete  those  existing  products  that  conform  to  the  basic 
requirements  of  the  imaging  hierarchy. 

Extensibility  implies  designing  evolution  into  the  sys¬ 
tem.  The  transmission  and  display  modules  of  the  system 
should  be  cast  as  building  blocks.  The  building  blocks, 
because  of  their  inherent  modularity,  may  freely  evolve 
over  time. 
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3  Analysis  of  ImaHing  Architectures 


3.1  Establishing  a  Framework  for  Analysis 

For  more  than  two  decades,  the  application  of  disital 
’  processing  techniques  ha.s  contributed  to  the  evolution  of 
analog  composite  television  systems,  especially  in  the 
areas  of  video  recording,  image  processing,  and  image 
synthesis.  This  evolutionary  use  of  digital  technology 
had  little  effect  on  the  perception  of  imaging  systems: 
from  this  perspective  many  observers  believed  th^  digi* 
tai  video  would  gradually  replace  analog  video  without 
any  fundamental  changes  to  the  foundation  of  imaging 
systems. 

However,  in  the  past  few  years  this  evolutionary  view  of 
imaging  systems  has  been  challenged.  At  the  26th  An« 
nual  SMPTE  Advanced  Television  and  Electronic  Imag¬ 
ing  Conference.  John  Watkinson  suggested  that  we 
analyze  the  impact  of  digital  technologies  from  another 
perspective:  "To  think  that  digital  technology  only  im¬ 
pacts  the  underlying  equipment  and  that  otherwise  it's 
business  as  usual  is  to  miss  the  larger  mnsformation  that 
is  occurring  in  each  of  the  affected  induscics." 

From  Watkinson's  perspective,  the  transition  to  a  new 
digital  imaging  architecture  represents  the  opponunity 
for  a  new  paradigm.  Proponents  of  this  position  have 
encouraged  system  designers  to  step  hack  and  ukc  a 
global  view  of  the  impact  that  digital  technologies  are 
'  having  on  every  industry  that  deals  with  electronic  imag¬ 
ing;  to  think  not  just  in  terms  of  delivering  ever-improv¬ 
ing  levels  of  image  quality,  but  of  w-hat  being  digital 
really  means. 

*  John  .Naisbett  in  his  1982  best  seller  Megtiirentis:  Tai 
Diri'i  liens  fur  Transforming  Our  Lives.  Stated  that  new 
technologies  go  through  three  phtises  as  they  become 
p.^!  of  our  daily  lives.  Applying  Naisbett 's  model  to  the 
evolution  of  electfonic  imaging  systems  leads  to  the  fol¬ 
lowing  three  paradigms: 

<  Paradigm  1  •>  the  technology  is  threatening,  so  it 
enters  society  in  a  non-threatening  way:  i.c..  enter¬ 
tainment  in  the  form  of  broadcast  television 

•  Paradigm  2  -  the  technology  is  c.xtcndcd  to  improve 
products  that  we  already  use  or  to  simplify  the  work 
that  we  do:  i.c..  professional  video,  camcorders  and 
the  home  VCR 

•  Paradigm  3  -  the  technology  enables  entirely  new 
applications:  i.e..  the  integration  of  video  with  other 
media  and  panicipation  by  the  cotesumer 

From  the  new  perspective,  being  digital  deals  with  the 
shift  to  the  third  par.idigm.  It  is  the  enabling  tcchiuilogy 
tiuit  has  rtuidc  it  possible  lor  this  T;tsk  Foax  to  aiuiiyze 
the  requiremenus  for  iiiicroperaliility.  scalability  and  ex¬ 
tensibility.  and  to  propose  a  set  of  gutdclincs  to  accom¬ 


plish  these  goals.  What  are  the  aspects  of  being  digital 
that  .have  brought  about  this  transformation  in  perspec¬ 
tives? 

A  major  factor  has  been  the  geometric  progression  in 
computer  processing  capabilities  •  doubling  computa- 
tioni  power  every  two  years,  with  h'ttle  change  in  cost  or 
size.  'Htis  progression  is  projected  to  continue  well  into 
the  next  century.  As  a  result,  high-resolution  still-image 
processing  capabilities  are  now  within  reach  of  every 
computer  user.  Techniques  once  reserved  for  high-end 
workstations  are  now  commonly  applied  in  desktop  com¬ 
puting,  including  the  recent  addition  of  full  motion  video 
as  a  dau  type. 

Video  has  also  been  a  major  beneficiary  of  the  technol¬ 
ogy  progression.  Production  systems  that  only  a  decade 
ago  required  a  six  foot  rack  of  electronics  can  now  be 
implemented  in  a  few  rack  units  -  or  on  a  few  cards  that 
plug  into  a  personal  computer. 

The  tremendous  increase  in  compuutional  power  has 
enabled  another  critical  aspect  of  being  digiDl  -  video 
encoding  based  on  the  use  of  digital  compression  tech¬ 
niques  to  reduce  the  required  dau  rate.  A  variety  of 
compression  technologies  have  evolved  that  remove 
image  redundancy  within  and  between  video  frames.  The 
required  data  rate  may  also  be  significantly  reduced  by 
more  efficient  coding  of  the  image  at  the  source.  Devel¬ 
opments  of  such  techniques  are  progressing  rapidly  and 
may  become  useful  in  the  near  future. 

While  compression  technology  has  existed  for  many 
years,  and  continues  to  evolve,  practical  im¬ 
plementations  for  video  have  only  become  possible  in  the 
p:ist  few  years  due  to  the  rapid  evolution  of  digital  process- 
ing  technologies.  This  in  turn  has  stimulated  new  research 
into  scalable  video  encoding  techniques  that  will  allow 
multiple  levels  of  image  quality  to  be  extraacd  from  a 
single  image  dau  stream.  Some  observers  predict  that  the 
processing  power  required  for  the  decoding  of  scalable 
digital  video  streams  will  be  universal  and  inexpensive 
before  the  end  of  this  decade. 

Improvements  in  data  compression  perform  the  same 
function  as  increases  in  bit  carrying  capacity  in  the  com¬ 
munications  system  -  delivery  of  more  bits  to  the  user.  In 
the  past  decade,  increases  in  communications  capacity  of 
sCNxral  orders  of  magnitude  have  occurred. 

In  such  an  cnvinminent.  the  kmgevity  of  new  equipment 
puroliascs  may  be  dependent  upon  a  digital  image  archi¬ 
tecture  that  is  designed  with  adequtite  provisions  fur  cx- 
toisibility.  To  meet  tills  objective  the  Task  Force  has 
fucu-sed  its  attention  wi  tlirec  areas: 
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•  FundamcnDl  propenics  of  human  visual  perception 
and  their  effect  on  the  design  of  imaging  systems 
(Section  32) 

•  Technology  trends  that  can  be  predicted  with  reason¬ 
able  certainty  (Section  B.-l) 

•  And  (most  imponant)  architectural  guidelines  that  if 
held  constant  will  promote  interoperability,  scalabil¬ 
ity  and  extensibility  (Section  4) 

3.2  Properties  of  Human  Visual  Perception 

The  human  visual  system  deals  with  the  physical  world 
both  in  terms  of  its  ability  to  resolve  image  deuil  (spatial 
resolution),  and  changes  in  the  environment  (temporal 
resolution).  We  experience  the  world  visually  by  aptur- 
ing  light  directly  from  a  source,  or  as  the  reflections  of 
light  off  of  objects  in  our  physical  environment.  The 
resulting  perceptions  of  the  environment  are  typically 
described  in  terms  of  size,  shape,  brightness,  color, 
depth,  direction,  and  speed.  These  qualities  arise  in  the 
brain's  image  processing  circuiu^:  essentially  they  result 
from  a  comparison  of  the  acquired  visual  cues  with  what 
we  have  learned  about  the  world's  intrinsic  suucture. 

As  research  has  revealed  more  about  the  physiology  of 
vision,  prevailing  theory  has  evolved,  pbcing  major  em¬ 
phasis  on  the  computational  and  cognitive  role  played  by 
the  brain  and  local  image  receptors.  In  turn,  this  research 
is  providing  potentially  valuable  input  to  the  designers  of 
digiul  imaging  systems. 

3.2.1  Human  Image  Acquisition 

'The  human  visual  system  relies  on  multiple  image  recep¬ 
tors  to  deal  with  the  diversity  of  environments  that  it 
encounters:  cones  are  utilized  for  color  image  acquisition 
over  a  wide  range  of  illumination  levels:  rods  are  utilized 
for  monochrome  image  acquisition  over  the  lower  range 
of  illumination  levels. 

The  eye  contains  approximately  two  million  cones  and 
120  million  rods.  The  cones  are  organized  into  three 
broad  groups  of  reeeptors  that  are  sensitive  to  light  in 
specific  spectral  bands:  while  these  bands  have  signif¬ 
icant  overlaps,  they  roughly  conform  to  the  red.  green, 
and  blue  ponions  of  the  spectrum.  Red  and  green  re¬ 
ceptors  each  outnumber  blue  receptors  by  a  factor  of 
two  to  one.  *rhe  dispersion  of  these  receptors  is  not 
uniform,  thus  spatial  perception  deals  with  a  complex 
matrix  of  receptor  types  and  cognitive  processing  by 
the  brain. 

The  ccnicr  of  the  visual  field,  an  area  called  the  fovea, 
contains  .^0.000  to  40.(X)0  cones  and  no  rods.  Outside 
the  fovea  the  density  of  cones  diminishes,  interspersed 
among  the  high  density  rods.  The  cones  within  the 
fovea  arc  responsible  for  high  spatial  detail  perception 
while  the  extrafoveal  cones  and  rods  play  an  important 


role  in  visual  search  and  influence  directed  eye  move¬ 
ment.  Central  vision  enables  us  to  see  detail,  while  pe¬ 
ripheral  vision  is  attuned  tochange. 

Although  high  spatial  resolution  vision  is  restricted  to  the 
fovea,  the  visual  system  acquires  high  resolution  images 
over  a  wide  portion  of  the  Ttcld  of  view.  This  is  achieved 
through  involuntary  eye  movements:  high  frequency 
tremor,  slow  drift,  and  rapid  saccade. 

Research  has  detemined  that  it  takes  several  hundred 
milliseconds  for  the  eye  to  acquire  a  high  spatial  resolu¬ 
tion  image,  synthesized  from  a  number  of  overbpping 
views.  Slow  drift  and  rapid  saccade  ars  the  mechanisms 
used  for  repositioning  the  fovea  to  xquire  these  multiple 
impressions.  The  tremor  appears  to  be  a  mechanism  to 
remove  high  frequency  spatial  noise.  ‘The  tremor's  oscil¬ 
lation  occurs  at  a  frequency  range  of  40  to  80  Hz  over  an 
area  approximately  equal  to  the  size  of  a  single  cone. 

Since  human  vision  is  binocular,  involuntary  eye  move¬ 
ments  also  contribute  to  depth  perception:  the  brain  pro¬ 
cesses  these  overlapping  views  to  obtain  differences 
from  which  depth  and  spatial  properties  are  inferred. 

The  spatbl  resolution  of  moving  objects  is  also  linked  to 
eye  movement’ 

•  If  the  eye  does  not  track  a  moving  object  the  object 
will  move  xross  multiple  sensors  creating  an  image 
with  low  spatial  resolution.  In  this  case  resolution 
depends  on  the  rate  of  movement:  rtqiid  motion  will 
create  a  blurred  response. 

•  If  the  eye  uacks  the  moving  object,  receptor  stimub- 
tion  is  aligned  with  the  object  and  additional  samples 
are  collxtcd  over  time.  Thus  the  image  will  be  of 
higher  spatbl  resolution. 

3.2.2  Human  Visual  Processing 

Much  of  the  research  in  visual  science  today  is  fxused 
on  the  processing  of  data  acquired  by  the  intage  recep¬ 
tors.  A  variety  of  speeblized  analyzers  in  the  eye  process 
data  from  small  localized  regions  and  xcumuiate  the 
results  bto  channels  which  are  processed  by  the  brain  to 
create  an  integrated  view  of  the  physical  environment. 

There  is  evidence  that  the  brain  directs  the  xtivity  of  the 
image  receptors  for  processes  sxh  x  establishing  white 
balance  and  light  scn.sitivity  levels.  Simple  localized  an¬ 
alyzers  are  used  to  enhance  the  data  iiansmincd  bxk  to  the 
brain.  Some  of  these  analyzers  are  sensitive  to  a  particular 
edge  orienQiion:  there  are  sulTicient  analyzers  at  exh  Ixa- 
tion  to  represent  a  full  set  of  edge  orbnutions.  Additional 
tuned  antilyzcrs  cover  ponions  of  the  range  of  human  sensi¬ 
tivity  fur  spatitil  frequency,  sptttbl  position,  temporal  fre¬ 
quency.  direction  of  motion,  and  binocular  disparity. 

The  data  processed  by  these  analyzen  mo\*cs  to  the  brain 
through  two  types  of  channels:  a  set  of  fast  responding 
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channels  with  relaiivcly  transient  responses  to  stimuli, 
and  a  set  of  slower  clunncis  with  relatively  susuined 
responses  to  stimuli.  Transient  channels  process  the  out¬ 
put  of  analyzers  that  are  tuned  for  low  spatial  and  high 
temporal  frequency  stimuli.  Sustained  channels  process 
the  output  of  analyzers  that  arc  tuned  for  high  spatial  and 
low  temporal  frequency  stimuli. 

3.2.3  Thresholds  for  the  Perception  of  Flicker 

Above  certain  frequencies,  nickering  light  sources  will 
appear  os  a  continuous  light  source.  The  relevant  fre¬ 
quency  is  colled  the  critical  fusion  frequency  and  varies 
with  the  level  of  illumination.  Separate  nicker  thresholds 
exist  for  the  transient  and  sustained  processing  channels. 

Transient  channels  are  sensitive  to  flickering  light 
sources  with  low  spatial  resolution;  this  type  of  stimula¬ 
tion  appears  as  wide-area  flicker  and  is  most  noticeable 
in  peripheral  vision.  At  low  levels  of  illumin.'iiion  (where 
rod  vision  is  used)  flicker  fusion  occurs  at  frequencies  of 
only  a  few  Hz:  as  the  level  of  illumination  increases  and 
cone  vision  is  triggered  the  fusion  frequency  increases. 

Flicker  from  low  light  level  sources  such  as  a  television 
or  movie  screen  typically  dis.*ippears  in  the  range  of  20  to 
60  Hz.  As  screen  size  increases,  taking  up  a  larger  portion 
of  the  Held  of  vision,  or  if  screen  brightness  increases,  the 
frequency  for  flicker  fusion  increases. 

Sustained  channels  are  sensitive  to  flickering  light 
sources  with  high  spatial  resolution:  this  type  of  stimula¬ 
tion  appears  os  small  .area-nicker,  often  associ.ated  with 
moving  objects.  In  this  case  the  flicker  fusion  frequency 
cat)  be  much  higher  th.^)  for  wide-area  nicker,  this  form 
of  flicker  manifests  itself  as  strobing  of  the  object. 

An  excellent  example  is  found  in  the  single  pixel  hori¬ 
zontal  lines  often  used  in  computer  graphics.  These  lines 
do  not  appear  to  nicker  on  a  progressive  scan  computer 
display  which  is  refreshed  at  rates  above  60  Hz:  but  if  the 
same  image  is  presented  on  an  interlaced  video  display 
the  single  pixel  lines  are  presented  in  every  other  field  (at 
30  Hz)' and  they  flicker.  Tltis  is  due  to  the  fact  that  the 
persistence  of  the  display  phosphor  is  of  shorter  duration 
than  the  rcfrcsit  rate:  higher  scanning  rates  (cither  pro¬ 
gressive  or  interlaced)  ciiminaic  the  flicker. 

3.2.4  Tuning  Electronic  Imaging  Systems  to 
Match  Human  Visual  Perception 

Our  improved  understanding  of  human  visual  percep¬ 
tion  together  with  an  exponential  improvement  in 
electronic  image  processing  techniques  has  set  the 
st.*)gc  for  the  design  of  a  new  digital  image  architec¬ 
ture. 

In  order  for  a  new  digital  image  architecture  to  he 
interoperable  it  must  deal  with  existing  imaging  tech¬ 
nologies.  This  requirement  can  pl.ncc  many  constraints 


on  the  design  of  the  architecture.  It  isimportant  tounder- 
siand  the  reasons  that  these  consu^ints  exist  todetermine 
if  the  new  architecture  must  be  similarly  consuained. 

3.2.5  The  Elimination  of  Flicker  on  Scanning 
Displays 

Information  is  presented  in  Section  4^  which  suggests 
that  the  refresh  rate  for  scanning  GIT  displays  should  be 
linked  to  the  field  of  view  and  brightness  of  the  dispby. 
Lower  refresh  rates  are  acceptable  when  the  display  cov¬ 
ers  a  narrow  field  of  view,  as  is  the  case  with  our  existing 
analog  composite  video  delivery  systems.  Lower  lefitesh 
rates  are  also  acceptable  for  a  display  with  a  wide  field  of 
view  at  low  brightness  levels:  typically  this  type  of  dis¬ 
pby  requires  a  viewing  environment  with  low  ambient 
light  levels  such  as  a  theater. 

As  the  dispby  covers  a  wider  field  of  view  at  higher 
levels  of  brightness,  the  refresh  rate  must  be  increased  to 
eliminate  wide-area  flicker.  If  informau'on  with  high  fre¬ 
quency  edges,  such  as  computer  generated  text  and 
graphics,  is  presented  on  the  display  it  must  also  be 
refreshed  at  a  higher  rate.  The  computer  industry  uses 
progressive  scanning  with  refresh  frequencies  above  60 
Hz  to  eliminate  flicker,  larger  displays  (2  16  inches  dbg- 
onal)  are  typically  refreshed  at  72  or  75  Hz. 

The  some  requirements  for  the  elimination  of  wide-area 
flicker  are  now  stoning  to  influence  the  development  of 
display  systems  for  home  entertainment.  At  the  higher 
end  of  the  home  enteruinmem  market  it  would  be  desir¬ 
able  for  dispbys  to  provide  a  50  degree  field  of  view,  and 
be  viewable  at  normal  room  ambient  light  levels.  Such  a 
display  has  resolution  and  refresh  requirements  nearly 
identical  to  a  large  personal  computer  ^'pby. 

3.2.6  Constraints  that  Dictated  the  Use  of  Inter¬ 
lace  Scanning  Technologies  in  Analog  Compos¬ 
ite  Video  Systems 

Several  factors  influenced  the  decision  to  use  interlace 
scanning  techniques  for  acquisition  and  dispby  when  our 
composite  video  systems  were  designed: 

•  Bandw’idth  a%’anable  for  signal  processing  and  trans¬ 
mission 

•  The  elimination  of  wide-area  flicker 

Both  interbeed  and  progressive  scanning  were  evalu¬ 
ated:  interbee  proved  to  be  the  best  solution  to  reduce 
signal  bandwidth  and  minimize  flicker  in  the  dispby. 

3.3  Models  for  the  Design  of  an  Imaging 
Architecture 

As  the  design  of  a  new  digital  imaging  architecture  is 
appraached.  it  is  important  to  take  into  account  of  all  the 
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applications  and  industries  that  may  utilize  the  architcc* 
ture.  as  well  as  the  economic  contributions  of  each  in  the 
development  and  purchase  of  the  system  components 
(see  Section  6).  Experience  with  analog  television  has 
amply  demonstrated  the  value  of  “economies  of  scale." 
The  opportunity  now  exists  to  design  an  open  digital 
imaging  architecture  that  is  based  on  generic,  inexpen¬ 
sive,  and  increasingly  powerful  processing  elements. 

The  choice  of  a  digital  image  architecture  has  im¬ 
plications  that  reach  (ar  beyond  the  normal  realm  of 
standards-setiing  activities.  Telecommunications,  televi¬ 
sion.  and  computing  have  made  major  impacts  on  life  in 
the  20th  century  » their  integration  is  likely  to  have  a 
profound  affect  on  the  way  that  the  world  communicates, 
is  educated,  works,  plays  and  relaxes  in  the  next  century. 

3.3.1  Components  of  the  Model  -  Resolution 

The  level  of  resolution  perceived  by  a  viewer  is  a  func¬ 
tion  of  the  distance  of  the  viewer  from  the  display.  Thus, 
to  design  a  digital  image  architecture  that  provides  con¬ 
stant  perceived  resolution  across  applications  that  in¬ 
volve  different  viewing  distances  (e.g..  close  for  a 
computer  dispby.  further  away  for  a  conventional  TV 
screen,  still  funher  for  a  large  flat-panel  display),  the 
system  must  be  scabble  in  terms  of  image  resolution. 

In  addition  to  holding  perceived  resolution  constant 
under  varying  viewing  distances,  it  is  considered  desir¬ 
able  to  provide  even  greater  resolution  in  some  applica¬ 
tions.  os  discussed  below  and  as  implemented  in  current 
proposals  for  advanced  television  systems. 

While  it  would  be  desirable  to  design  an  imaging  archi¬ 
tecture  in  which  resolution  could  be  scaled  in  a  continu¬ 
ous  fashion,  a  hierarchy  based  on  a  progression  of  related 
image  resolution  levels  can  provide  similar  benefits  to 
system  designen  and  simplify  the  process  of  interopera¬ 
tion.  Sections  52  and  5J  provide  a  detailed  analysis  of 
the  variables  that  affect  the  perceived  resolution  of  a 
display  and  illustrate  the  principles  of  a  hierarchical  dig¬ 
ital  image  architecture  with  a  progression  of  four  image 
resolution  levels. 

Throughout  this  Repon.  the  concept  of  a  multi-resolution 
hierarchy  will  be  discussed  and  renned.  The  Task  Force 
has  constrxicted  a  model  to  facilitate  this  discussion.  It  is 
recognized  that  many  different  sets  of  numbers  can  be 
used  within  this  model.  Four  levels  of  resolution  have 
been  identified  and  defined:  additional  levels  can  be 
added  to  the  progression,  as  enabling  technologies  allow 
support  for  higher  Icx'cls  of  resolution.  The  four  levels  in 
the  model  are: 

•  Low  Resolution  -  systems  designed  to  deliver  images 
over  a  close-  up.  narrow  Held  of  view,  particulurly 
when  con.s'tniincd  by  cost  iir  bit  rate,  nuiy  deliver 
imagery  at  less  than  average  hunuin  visual  acuity. 
Videoconferencing,  vidcuinail  and  personal  attcr- 


tainment  systems  are  good  examples  of  applications 
for  low  resolution  imaging. 

•  Normal  Resolution  -  systems  designed  to  deliver 
images  with  resolution  requirements  that  approxi¬ 
mate  average  human  visual  acuity.  This  level  of  res¬ 
olution  is  suitable  for  the  delivery  of  natural  imagery, 
as  is  the  case  in  most  entertainment  applications. 
Present  television  systems  are  in  this  range,  and  studio 
quality  analog  component  video  is  at  the  high  end  of 
the  range. 

•  HighResolution->systemsdesignedtodeiiverimages 
over  a  wide  field  of  view,  with  high  spatial  and/or  high 
temporal  frequencies,  typically  have  resolution  re¬ 
quirements  near  the  limits  of  human  visual  percept 
tion.  Applications  include  motion  picture  delivery  in 
theatres,  entertainment  and  information  presenutions 
to  large  audiences,  and  personal  computer  displays. 

•  Ultra  High  Resolution  -  systems  desi  gned  to  deliver 
images  over  the  entire  field  of  view  (or  beyond),  at 
the  spatial  and  temporal  limits  of  human  visual  per¬ 
ception.  pbee  the  most  demanding  requirements  on 
an  imaging  architecture.  Theatre  delivery  systems 
such  as  Showscans  and  IMAX®  are  in  this  category. 
Electronic  imaging  systems  for  simubtion  are  reach¬ 
ing  this  level  of  resolution  as  the  enabling  technolo¬ 
gies  make  delivery  at  this  level  feasible. 

The  concept  of  interoperability  first  appeared  in  the  early 
days  of  television  because  of  the  need  to  integrate  film 
material  into  television  program  content.  Unfortunately, 
film  and  video  were  in  some  respects  incompatible.  Eb^ 
orate  shuttering  mechanisms  were  developed  for  the  tele¬ 
cine  to  make  it  possible  to  dispby  film  in  the  world  of 
video:  thus  the  concept  of  interoperability  was  bom.  For 
525/60  the  compromise  was  the  use  of  3-2  pull  down,  to 
accommodate  the  change  from  24  to  30  fps  (frames  per 
second).  The  solution  for  625/50  was  easier  -  a  4%  speed 
change,  pbying  programs  acquired  at  24  fps  at  the  televi¬ 
sion  rate  of  25  fps. 

The  evolution  of  electronic  image  acquisition  systems 
has  been  driven  primarily  by  the  mass  market  tnnsmis- 
sion  sbndards  -  NTSC,  PAL  and  SECAM.  New  applica¬ 
tions  for  video  such  as  professional  and  personal  video 
systems  have  been  enabled  through  the  economies  of 
scale  associated  with  these  standards. 

Thus,  applications  which  required  higher  resolutions 
than  those  offered  by  NTSC.  PAL  and  SECAM  have 
either  been  forced  to  bear  the  expense  of  system  develop¬ 
ment  and  low  volume  manufacturing  -  a  luxury  primar¬ 
ily  rescr\'cd  for  the  military  -  or  to  wait  for  the  next 
imaging  standard  to  evolve.  It  is  interesting  to  note  that 
the  equipment  dcvelojicd  for  the  various  analog  HDTV 
systems  has  seen  extensive  use  in  professional  applica¬ 
tions  that  need  the  added  resolution  afforded  by  these 
systems. 
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3.3.2  Components  of  the  Model  -  Acquisition, 
Transmission  and  Display 

The  path  through  which  an  image  passes  from  capture  to 
display  may  involve  as  many  as  five  major  steps  os 
shown  in  Ftgute  3.1.  These  steps  are  discussed  in  detail 
in  Section  62J  of  this  Repon.  As  the  imagery  moves 
from  one  step  to  the  next,  it  may  be  stored  at  one  of 
several  quality  levels. 


3.  Transport 

The  imagery  may  be  encoded  and  stored  at  a  lower  level 
of  quality  to  conform  to  the  transpon  characteristics:  we 
refer  to  this  as  distribution  quality  storage. 

Finally,  the  imagery  must  be  decoded  for  display,  requir* 
ing 


The  first  two  steps  are 

1.  Capture  (by  an  image  sensor) 

2.  Prwessing 

and  typically  require  production  quality  storage  to  pre¬ 
serve  as  much  of  the  original  imagery  as  possible  for 
subsequent  processing.  After  the  processing  steps  have 
been  completed,  the  imagery  may  be  stored  at  a  lower  level 
of  quality  for  release  to  the  distributor  of  the  imagery:  this  is 
often  referred  to  as  contribution  quality  storage. 

Delivering  the  imagery  to  the  consumer  typically  in¬ 
volves  the  third  step. 


4.  Reconstruction 

5.  Display 

The  consumer  may  store  the  imagery  for  viewing  at 
another  time:  this  also  requires  distribution  quality  stor¬ 
age. 

Some  of  these  steps  tend  to  be  grouped  with  a  specific 
level  of  storage  quality,  as  illustrated  in  Figure  3.1.  This 
allows  a  funher  simplification  of  the  model  based  on 
three  major  system  components  -  ACQUISITION. 
TRANSMISSION.and  DISPLAY. 


Figure  3.1 
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3.3.3  A  Closed  Architecture  Model  -  Analog 
Composite  Video 

The  transmission  standards  for  the  existing  composite 
video  systems  frequently  require  all  of  these  components 
to  operate  in  close  synchronism.  The  display  is  synchro¬ 
nized  with  live  or  taped  program  material  that  feeds  the 
oansmission  system.  Imagery  acquired  at  other  spatial  or 
temporal  resolutions  requires  conversion  into  the  spatial 
and  temporal  specifications  of  the  uansmission  standard. 
Such  an  architecture  is  depicted  in  Figure  3.2. 

The  advent  of  video  recording  provided  a  degree  of  de¬ 
coupling  of  acquisition  from  the  other  components,  al¬ 
lowing  program  producers  to  create  program  content 
without  real-time  constraints:  however,  transmission  and 
display  remain  tightly  coupled.  Recording  media  for  pro¬ 
gram  content  have  typically  been  coupled  to  the  trans¬ 
mission  standard  to  take  ^vantage  of  the  bandwidth 
reduction  techniques  applied  in  the  system.  The  design  of 
consumer  VCRs  is  based  on  compatibility  with  the  trans¬ 
mission  standard:  packaged  media  played  by  the  VCR 
must  therefore  conform  to  the  same  standard. 

While  interoperability  between  the  various  analog  com¬ 
posite  video  systems  has  had  to  overcome  differences  in 
frame  and  line  rates,  these  systems  have  been  remarkably 
extensible.  The  acquisition,  transmission  and  display 


components  and  the  associated  services  of  the  system 
have  evolved  continuously  over  the  past  fifty  years. 

With  the  inuoduction  of  analog  component  video  record¬ 
ing  and  processing  systems  in  the  *80s  the  video  industry 
took  a  major  step  toward  completely  decoupling  acquisi¬ 
tion  from  transmission  and  display.  The  production  corn- 
munity  soon  discovered  the  advantages  of  this 
decoupling. 

By  using  analog  component  equipment  for  both  acqui¬ 
sition  and  production,  it  became  possible  to  edit  video 
without  concern  for  the  multi-field  color  framing  se¬ 
quences  that  exist  in  subcarrier  encoded  composite 
video  systems.  Producers  also  discovered  that  fewer 
artifacts  were  introduced  when  byering  video  using 
component  vision  mixers  and  digital  video  effect  sys¬ 
tems.  Decoupling  of  xquisition  and  production  equip¬ 
ment  from  the  encoded  transmission  standard 
produced  far  better  results  than  could  be  xhieved  with 
composite  video  xquisition  and  prodxtion  equipment 
-  and  the  same  video  recorders  also  produced  encoded 
outputs  for  transmission  of  the  program. 

3.3.4  An  Open  Architecture  Model  -  Digital 
Hierarchies 

In  the  '80s,  the  publishing  industry  experiexed  the  colli¬ 
sion  of  analog  xd  digital  txhnologies.  Today,  inter- 


Figure  3.2 
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operability  of  media  in  the  publishing  industry  is  the  rule 
rather  than  the  exception,  as  digital  image  and  document 
processing  techniques,  generally  categorized  under  the 
umbrclb  of  Desktop  Publishing,  have  repbeed  tradi¬ 
tional  analog  techniques. 

To  a  large  extent,  the  transition  from  the  analog  represen¬ 
tations  of  printed  media  -  type,  line  an.  halftones,  and 
color  separations  -  to  their  digital  counterpans,  has  been 
enabled  by  the  use  of  scalable  hierarehies  for  the  acquisi¬ 
tion.  transmission,  and  display  of  printed  materials.  The 
tools  for  acquisition  and  production  of  print  media  have 
been  separated  from  the  display  hierarchy,  allowing  out¬ 
put  at  the  desired  level  of  resolution. 

Electronic  transmission  is  also  beginning  to  play  a  major 
role  in  the  publishing  of  documents.  Compact  represen¬ 
tations  of  printed  media,  using  page  description  bn- 
guages.  have  allowed  high  quality  print  represenDUons 
to  be  moved  efficiently  through  the  telecommunications 
network  using  low  data  rate  modems.  Remote  printing  of 
documents  on  fax  machines  or  networked  printers  is 
commonplace. 

The  desktop  publishing  metaphor  has  been  used  as  a 
mode!  to  predict  similar  transitions  in  other  media  indus¬ 
tries.  most  notably  Desktop  Video.  However,  the  transi¬ 
tion  has  not  occurred  at  the  pace  that  many  industry 


pundits  have  predicted.  This  is  due.  in  large  part,  to  the 
difficult  task  of  breaking  the  problem  up  into  manageable 
components:  to  create  separate  hierarchies  for  xquisi- 
tion.  transmission  and  dispby  of  motion  imagery. 

Intcropc.rability  of  video  systems  with  other  medb  is 
facilitated  by  a  complete  decoupling  of  the  acquisition, 
transmission  and  display  into  separate  hierarchies  for 
each  component.  Such  an  architecture  is  depicted  in  Fig¬ 
ure  3  J.  Scabble  representations  of  ^deo  will  be  enaUed 
by  this  decoupling,  and  technological  advances  in  one 
hierarchy  can  take  pbee  without  upsetting  the  apple  cart 
in  the  other  twa 

If  a  hierarchical  digital  imaging  architecture  is  used  as 
the  model,  a  digital  advanced  television  system  can  be 
implemented  that  is  equally  adept  in  delivering  low  cost 
solutions  that  conform  to  a  single  hierarchy,  as  well  as 
more  expensive  scabble  solutions  that  suppon  multiple 
hierarchies. 

The  acquisition  hierarchy  can  provide  image  capture  so¬ 
lutions  at  various  price/performance  points  that  are  ap¬ 
propriate  for  the  application.  Production  systems  can 
evolve  that  deal  with  single  image  formats,  or  multiple 
formats  within  the  hierarchy.  This  is  of  particular  import¬ 
ance  to  producers  of  program  content  with  significant 
archival  value.  Imagery  can  be  captured  at  a  higher  level 
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in  the  acquisition  hierarchy  with  an  eye  toward  disiribu* 
lion  at  one  or  more  of  the  lower  levels  of  the  transmission 
hierarchy:  the  archival  value  of  the  program  is  protected 
since  it  can  be  released  at  higher  quality  levels  in  the 
future  os  consumers  purchase  products  at  a  higher  level 
in  the  dispby  hierarchy. 

Viewing  transmission  as  a  hierarchy  is  critical  to  the 
concept  of  interoperability.  A  hierarchical  imaging  archi¬ 
tecture  would  support  a  progression  of  image  quality 
levels  that  are  interoperable  and  extensible,  attd  allow  for 
incTemental  improvements  in  image  quality  within  a  sin¬ 
gle  transmission  standard.  This  requires  the  use  of  a 
scabble  encoding  structure:  a  core  image  would  be  en¬ 
coded  at  the  ftrst  level  of  the  hierarchy,  and  enhancement 
information  would  be  encoded  for  each  of  the  higher 
resolution  levels  supponed  by  the  tnnsmission  standard. 

A  scalable  encoding  structure  may  be  more  difTiCult  to 
design  and  possibly  less  efficient  for  a  given  quality  level 
than  an  encoding  deigned  specifically  for  that  level.  It  has, 
however,  several  advantages  that  will  accrue  over  time: 

•  It  allows  receivers  to  be  deigned  at  different  quality 
(and  cost)  levels  and  still  share  components  and  a 

<  eommon  transmission 

•  It  could  alleviate  the  need  to  create  a  new  transm  ission 
standard  from  scratch  as  the  evolution  of  compression 
algorithms  permits  improvements  in  quality  (for  the 
same  bit  expenditure) 

•  It  can  conserve  bandwidth  when  image  quality  is 
added  incrcmenully  to  existing  transmissions,  rather 
than  creating  new  transmissions 

The  economic  benefits  associated  with  scalable  image 
encoding  will  be  significant.  The  emerging  consensus 
among  experts  in  video  compression  technology  is  that 
scalability  will  carry  a  minor  penalty  for  encoding  over¬ 
head.  Consider  the  impact  on  media  server  storage  sys¬ 
tems:  a  single  scalable  representation  will  make  more 
efficient  use  of  storage  than  multiple  copies  of  the  same 
material  at  different  scales. 

The  dispby  hierarchy  allows  for  a  variety  of  products  to 
evolve  at  various  price/pcrformancc  points  that  arc  ap- 
proprbte  for  the  application.  Some  display  systems  will 
evolve  to  single  performance  levels  while  others  will 
offer  multiple  levels  of  performance  within  the  transmis¬ 
sion  and  dispby  hierarchies. 

Scalability  pbys  a  major  role  in  the  design  of  decoder 
and  dispby  components.  If  the  transmission  system  de¬ 
livers  a  scalable  payload,  only  that  ponion  of  the  infor¬ 
mation  which  is  required  for  the  display  system  need  be 
decoded.  .A  small  pcrsoiuil  inform:iiion  system  may  only 
need  the  low  resolution  component  while  a  high-end 
home  entertainment  system  c:m  utilise  all  ol  the  restilu- 
tion  components. 


3.4  Factors  That  Have  the  Potential  to  Fun¬ 
damentally  Change  Digital  Image  Architec¬ 
tures 

Real  world  constraints,  cspccblly  with  respat  to  cost 
versus  performance,  arc  the  driving  factors  in  the  im¬ 
plementation  of  a  digital  image  archiiaiure.  In  determin¬ 
ing  the  requirements  for  the  architecture,  the  Task  Force 
has  analyzed  the  current  market  situation  as  well  as  tah- 
oology  and  regulatory  trends  that  may  reshape  the  mar¬ 
ket  in  the  next  few  daadcs. 

.  The  technology  for  image  aquisition  is  still  evolving 
rapidly  -  cameras  capable  of  delivering  the  highest 
levels  of  spatial  and  temporal  resolution  are  expen¬ 
sive.  It  is  anticipated  that  the  image  acquisition  mar¬ 
ket  will  continue  to  be  characterized  by  a  wide  range 
of  products  at  different  price/pcrformance  levels. 

•  The  storage  of  image  data  for  subsequent  processing 
or  viewing  is  a  major  component  of  the  expense  in 
most  imaging  systems  -  even  with  digiral  video  com¬ 
pression.  Storage  cost  is  diratly  lebted  to  the  data 
content  of  the  image  -  higher  performance  equates  to 
higher  system  cost. 

•  Transmission  channels  such  as  terrestrial  and  satellite 
broadcast  are  limited  by  regubtion  and  practical  tech¬ 
nology.  Networked  systems  (cable  and  telco)  are  lim¬ 
ited  by  the  installed  infrastructure  and  the  cost  of 
providing  additional  bandwidth  for  new  applications: 
while  available  bandwidth  has  increased  by  many 
orders  of  magnitude,  many  additional  orders  of  mag¬ 
nitude  must  be  xhieved  to  make  the  delivery  of  high 
resolution  imagery  as  generic  and  affordable  as  our 
telephone  service  is  today. 

•  When  viewing  imagery,  a  number  of  inter-related 
vari.nbles  affat  both  the  cost  and  performance  of  the 
display  system:  viewing  environment  (cspccblly  am¬ 
bient  light),  viewing  distance,  field  of  view,  and  the 
desired  resolution.  It  is  anticipated  that  the  display 
market  will  continue  to  be  characterized  by  a  wide 
range  of  products  at  different  price/pcrformance  lev¬ 
els. 

3.4.1  A  Shift  in  the  Pricing  Structure  of  Broad¬ 
band  Telecommunications 

Changes  in  the  regulatory  climate  arc  likely  to  cause 
increased  competition  among  all  networked  service  pro¬ 
viders  (telcos,  cable,  tkila  networks,  etc.)  and  encourage 
service  providers  to  urgn-ulc  the  quidiiy  and  capacity  of 
these  networks. 

The  current  pricing  structure  for  broad  band  telecommu¬ 
nications  is  typically  based  on  channel  bandwidth  -  the 
purchaser  u.scs  and  ptiys  for  the  entire  channel  regardless 
of  the  amount  of  iiifomuition  moved  through  it.  In  the 
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future,  greatly  increased  channel  bandwidth  and 
packetized  encoding  schemes  using  headers/descriptors 
for  packet  identification  will  cause  a  shift  in  pricing 
structure  -  the  purchaser  will  pay  only  for  the  informa¬ 
tion  content  that  moves  through  the  channel.  This  con¬ 
cept  when  applied  to  video  services  has  been  described 
as  pay-per-view-per-bit. 

This  shift  in  pricing  structure  is  likely  to  act  as  a  catalyst 
for  the  rapid  evolution  of  video  compression  techniques 
aitd  transmission  standards,  with  an  emphasis  on  two 
areas: 

•  Dau  rate  reduction  -  delivering  increasingly  higher 
levels  of  quality  while  holding  bit  rates  consunt 

•  Scalability  •  a  shift  toward  encoding  schemes  that 
allow  for  different  levels  of  image  quality  to  be  ex¬ 
tracted  from  the  encoded  digital  data 

During  the  transition  to  wide  bandwidth  communications 
channels,  data  rate  reduction  will  be  the  driving  force  as 
the  cost  per  bit  remains  relatively  high.  As  the  cost  per  bit 
declines,  the  emphasis  will  shift  to  scahbiliiy.  This  will 
be  due  largely  to  the  market  advantages  of  maintaining  a 
^  single  dau  file  that  can  be  delivered  to  a  wide  range  of 
users  at  different  levels  of  the  dispby  hierarchy. 

3.4.2  Programmable  Decoders 

Another  major  trend  that  is  anticipated  is  the  evolution 
from  fixed  single-stand.vd  decoders  to  programmable 
decoders  that  can  ad.’ipt  to  scabble  image  representa¬ 
tions.  Single-standard  decoders  will  be  used  primarily  for 
devices  that  up  into  the  communicauons  network  and 
deal  only  with  one  t)’pe  of  image  representation.  Pro¬ 
grammable  decoders  will  deal  with  families  of  standards. 
Fax  machines  serve  as  a  good  example  of  single  and 
multiple  sund.'ird  encodcr/decoders.  The  Group  1  fax 
standard  provided  a  single  level  of  resolution:  machines 
were  expensive  and  their  use  was  limited.  With  the 
addition  of  Group  3  fax  st.nnd;irds.  multiple  levels  of  reso¬ 
lution  were  supponed.  including  the  older  Group  1  format. 
Due  to  advanees  in  technology,  the  new  machines  were 
better  and  cheaper,  yet  compatible  with  the  existing  Group 
1  machines.  The  marketplace  responded  in  a  very  positive 
manner. 

Programmable  decoders  will  be  the  key  component  in 
providing  extensibility  to  the  digital  imaging  architec¬ 
ture.  Because  of  the  diversity  of  image  compression  stan¬ 
dards  (Croup  .1  fax.  H.261.  JPEG.  MPEG.  DVI.  etc.), 
these  decoders  will  play  an  imponant  role  in  the  integra¬ 
tion  of  video  and  high  resolution  imaging  with  dc.sktop 
computer  workstations.  This  .s:imc  diversity,  with  the  ad¬ 
dition  of  a  digital  televi.<;ion  sundard  (or  stand:ird.s)  will 
lead  toward  the  use  of  programmable  decoders  in  home 
entertainment  and  information  delivery  systems.  Essen¬ 
tially  Axed  solutions  will  drive  the  low  end  of  the  nuuket. 
providing  inexpeasive  mass  nurket  consumer  products. 


while  programmable  solutions  will  dominate  at  middle 
and  upper  levels  of  the  transmission  and  display  hierar¬ 
chies. 

3.4.3  Trends  in  Display  Technology 

The  use  of  scanning  CRT  display  technology  for  certain 
applications  is  expected  to  decline  over  the  next  decade 
as  LCD  based  direct-view  displays  and  projection  sys¬ 
tems  are  perfected.  LCD  dispbys  are  used  extensively 
today  in  portable  computers,  and  LCD  light  valves  for 
high  resolution  projection  are  showing  great  promise.  In 
a  light  valve,  the  LCD  is  used  to  control  the  amount  of 
light  -  from  a  flicker-free  light  source  •  that  can  pass 
through  each  pixel  location:  since  the  dispby  is  no  longer 
the  light  source,  significant  improvements  in  brightness 
can  be  achieved. 

The  characteristics  of  LCD  displays  are  signiflcantly  dif¬ 
ferent  from  flying-spot  scanning  CRT  dispbys.  Flying- 
spot  systems  must  operate  at  refresh  rates  above  the 
critical  frequency  for  flicker  fusion:  display  brightness  is 
limited  since  the  spot  is  the  only  source  of  illumination 
(most  of  the  dispby  is  decaying  at  any  point  in  time). 

Every  pixel  in  an  LCD  display  receives  constant 
illumination.  LCDs  can  be  characterized  as  having  long 
persistence:  in  fact,  a  significant  design  challenge  has 
been  to  provide  faster  pixel  response  to  deal  with  full 
motion  video.  This  has  been  accomplished  through  the 
use  of  a  transistor  at  exh  pixel  location  (an  active  matrix 
display),  providing  rapid  response  for  pixel  replenish¬ 
ment. 

The  nature  of  the  active  matrix  circuit  also  allows  a  pixel 
value  to  be  held  for  at  least  one  second  without  replenish¬ 
ment.  giving  the  display  characteristics  similar  to  a  frame 
buffer.  Direct  addressing  of  each  pixel  location  would 
make  it  possible  to  update  only  those  pixels  which 
change  from  one  refresh  period  to  the  next.  Transmission 
systems  that  utilize  digital  compression  techniques  to 
eliminate  interframe  image  redundancies  may  take  ad¬ 
vantage  of  these  aspects  of  LCD  displays  to  implement 
conditional  replenishment. 

3.4.4  Conditional  Replenishment 

A  signiflext  portion  of  the  dab  rate  reduction  xhieved 
by  digiul  image  compression  techniques  deals  with  the 
elimination  of  interframe  redundancies.  In  essence,  much 
of  the  complexity,  and  hence  the  cost,  of  these  encoding 
systems  involves  the  processing  required  to  analyze  m<y 
tion  image  data  streams  to  determine  which  pixels  have 
changed  between  temporal  samples. 

Over  the  next  10  to  15  years  im.'igc  acquisition  and  dis¬ 
pby  technologies  are  likeiy  to  move  to  conditional  re¬ 
plenishment.  Image  xquisitiun  systems  may  evolve  with 
on-lMard  digital  processing  to  implement  conditional 
image  xqui-siiion.  These  cameras  will  be  programmable. 
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orTcring  several  advantages  over  scanning  camens  that 
continuously  update  the  entire  image  raster,  including  the 
ability  to: 

•  Acquire  high  spatial  resolution  and  high  temporal 
resolution  dau  at  different  ntes 

•  Adjust  sensitivity  as  a  function  of  the  temporal  update 
rate 

•  Output  information  about  multiple  objects  within  the 
image  and  their  motion  vectors 

•  Relate  movement  of  the  sensor  (pan  and  tilt)  to  image 
content,  allowing  for  update  of  only  the  newly  ac< 
quired  portions  of  the  image 


Future  display  technologies  are  likely  to  evolve  around 
direct  view  displays  (possibly  LCD)  offered  in  different 
pixel  densities.  Direct  addressing  of  LCD  displays  will 
allow  the  use  of  conditional  refreshment  of  only  those 
pixels  that  change  from  one  refresh  period  to  the  next: 
the  display  itself  may  become  the  frame  buffer,  allowing 
portions  of  the  image  to  be  updated  at  different  temporal 
rates.  Or,  combined  with  an  appropriate  multi*poned 
frame  buffer  design,  such  a  dispby  could  support  mulii- 
pie  temporal  refresh  oics  simultaneously  for  difTcreni 
image  streams. 
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4  Critical  Issues _ _ 

4.1  Introduction 

The  Task  Force  has  idcntificil  scN-en  issues  thai  arc  con¬ 
sidered  critical  to  the  achievement  of  the  ohjcciives. 
Many  of  these  issues  arc.  by  their  nature,  complex. 

Backward  compatibility  to  existing  systems  and  extensi¬ 
bility  to  future  systems  present  many  technical 
challenges.  The  greatest  challenge  lies  in  preserving  the 
value  of  existing  infrastructures  while  enabling  an  or¬ 
derly  tnnsition  to  the  new  architecture.  For  example, 
immense  investments  have  been  made  in  the  acquisition 
and  transmission  infrastructures  of  our  existing  NTSC. 
PAL  and  SECAM  television  systems.  Likewise,  billions 
of  consumers  have  invested  in  receivers  and  video  re¬ 
corders  that  support  these  systems.  It  is  equally  critical 
that  investment  in  the  vast  archives  of  information  and 
entertainment  programming  that  exist  today  on  Him  and 
video  be  protected,  and  that  the  new  architecture  unlock 
the  economic  potential  of  these  archives. 

In  delibenting  on  these  critical  issues,  every  effort  has 
been  made  to  balance  the  interests  arising  from  those 
investments  with  the  future  benefit  to  all  of  a  single 
global  standard.  These  deliberations  have  also  taken  into 
consideration  the  insulled  bases  of  computer,  medical, 
engineering  and  scientific  imaging  systems,  and  the  di¬ 
verse  applications  for  still  imaging  in  electronic  publish¬ 
ing.  visual  databases  and  communications.  Existing 
systems  that  demonstrate  interoperability  and  extensibil¬ 
ity  •  including  some  which  have  in  fact  been  extended  - 
were  considered.  Examples  include  the  French  Minitel 
system  and  the  family  of  international  facsimile  sun- 
dards. 

The  seven  critical  issues  arc: 

•  The  Establishment  of  Scalable  and  Interoperable  Hi¬ 
erarchies  for  Basic  Image  Panmeters 

•  The  Establishment  of  an  Appropriate  Relationship 
Between  Image  Acquisition  and  Display  Refresh 
Rates 

•  The  Use  of  Square  Sampling  Grids  (Square  Pixels) 

•  The  Establishment  of  Appropriate  Representations 
for  Colorimetry.  Dynamic  Range  and  Transfer  Char¬ 
acteristics 

•  The  Use  of  Coherent  image  Sampling  (Progressive 
Scanning) 

•  IdcntincationoftlicCharactcristicsofaDigitai  Image 
Stream  (Hcadcrs/Dcscriptors) 

•  The  Estahl  ishment  of  Appropriate  Levels  of  Compat¬ 
ibility  with  Current  Television  and  Motion  Picture 
Siand.vds 


4.2  The  Establishment  of  Scalable  and  Inter¬ 
operable  Hierarchies  for  Basic  Image  Pa¬ 
rameters 

An  ideal  digital  image  architecture  would  allow  the  fol¬ 
lowing  image  parameters  to  be  independently  varied, 
over  a  range  of  appropriate  values: 

•  Spatial  resolution 

•  Field  of  view 

•  Viewing  distance 

•  Aspect  ratio 

•  Image  xquisition  rate 

•  Display  refresh  rate 

•  Dynamic  range 

•  Colorimetry 

While  this  independence  may  be  technically  feasiUe 
within  the  ftfty  year  life  span  desired  for  the  first  digiol 
imaging  architecture,  it  does  not  appear  to  be  practicable 
for  immediate  implementation,  nor  is  it  requirel  The 
choice  of  an  appropriate  hierarchy  for  each  these  param¬ 
eters  can  provide  adequate  degrees  of  freedom  for  sys¬ 
tem  design,  while  facilitating  affordable,  high  quality 
transcoding  between  the  levels  in  each  hierarchy. 

Scalable  and  interoperable  hierarchies  offer  many  bene¬ 
fits  when  communications  channel  issues  are  considered. 
Such  an  approach  promotes  effective  utilization  of  exist¬ 
ing  communications  channels  and  the  development  of 
new  broad  band  communication  services.  The  lower  lev¬ 
els  of  the  hierarchy  provide  solutions  for  the  capacity 
constrained  channels  that  exist  today.  The  introduction  of 
new  broad  band  communications  services  will  enable  the 
use  of  higher  data  rates  to  support  the  improved  perfor¬ 
mance  available  at  higher  levels  in  each  hierarchy. 

A  digital  image  architecture  that  provides  interoperabil- 
iiy  across  applications  with  different  ^tial  resolution 
n^uiremems  must  be  scalable  in  terms  of  resolution  as 
disoissed  in  Section  3  J.  Interoperability  also  requires  a 
family  of  related  image  acquisition  and  dispby  rates.  The 
greatest  benefit,  in  terms  of  cost  and  simplicity,  is  gained 
when  the  dispby  operates  at  the  same  rate  as.  or  an 
integer  multiple  of.  the  image  acquisition  rate.  Though 
more  expensive  to  implement,  the  greatest  performance 
benefit  is  gtiincd  when  motion  compensation  techniques 
are  used  in  encodcrs/dccodcrs  to  create  in-between 
frames  for  dispby.  Section  5.4  discusses  the  require¬ 
ments  fur  such  a  family. 
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To  faciliiaic  this  hierarchical  approach  lo  a  digial  image 
architecture,  a  scabble  approxh  to  image  coding  is  re¬ 
quired.  Furthermore,  improved  techniques  for  video 
compression  are  likely  to  be  enabled  by  the  geometric 
progression  in  computational  hardware.  The  design  of 
the  architecture  must  make  provisions  for  this  progres¬ 
sion.  Section  5J  discusses  the  use  of  scabble  coding 
algorithms. 

4J  The  Establishment  of  an  Appropriate 
Relationship  Between  Image  Acquisition  and 
Display  Refresh  Rates 

In  early  discussions  about  the  use  of  digital  codings  for 
HDTV  systems,  it  became  clear  that  receivers  would 
likely  need  one  or  more  Game  stores  to  implement  image 
decoding.  This  prompted  the  idea  that  image  acquisition 
fates  could  be  decoupled  from  dispby  refresh  rates  -  the 
dispby  could  be  refreshed  at  a  rate  that  is  an  integer 
multiple  of  the  acquisition  rate.  For  this  reason  the  ques¬ 
tions  of  image  acquisition  rates  and  display  refresh  rates 
will  be  considered  separately. 

No  topic  generated  as  much  discussion  in  the  Task  Force 
as  image  acquisition  and  display  refresh  rates.  This  is  due 
in  pin  to  the  divenity  of  rates  that  exist  in  the  sundards 
and  resulting  practices  within  each  of  the  affected  indus¬ 
tries.  The  issue  is  further  complicated  by  the  evolution  of 
television  down  parallel  paths  with  respect  to  Held  rates. 
Their  harmonizauon  will  require  solutions  that  lie  in  the 
realm  of  digital  technology  as  well  as  the  realm  of  poli¬ 
tics  and  negotiation. 

The  choice  of  an  image  acquisition  rate  is  a  tradcolT 
between  motion  rendition  and  the  resulting  data  rate.  The 
following  considerations  are  imporunt  in  establishing  a 
family  of  acquisition  rates. 

•  The  ability  to  adjust  the  rebtionship  of  motion  rendi¬ 
tion  and  dau  rates  to  meet  specific  application  re¬ 
quirements 

•  The  cost  and  quality  of  transcoding  between  acquisi¬ 
tion  rates 

•  The  family  should  include  24  or  25  fps  for  acquisition 
of  film  •  to  include  both  would  result  in  two  families 
of  image  acquisition  rates  with  poor  interoperability 

•  For  North  Ameria  a  family  based  on  12  fps  is  one 
possible  choice.  However,  if  agreement  were  possible 
on  a  global  standard,  a  12  J  fps  rebted  family  might 
be  acceptable  in  North  America,  recognizing  that  this 
would  require  a  4%  speed  change  for  acquisition  of 
24  fps  film. 

There  are  many  factors  aflccting  the  choice  of  a  dispLiy 
refresh  rate  including; 

•  Dispby  technology -CRT.  LCD.  laser,  etc. 


•  Size 

•  Brightness 

•  Lag 

•  Application 

•  Cost 

Refresh  rate  will  be  determined  by  the  above  criteria  and 
price/performance  requirements  established  by  market 
factors. 

Experience  has  shown  that  for  wide-screen  CRT  dispbys 
of  high  brightness,  a  refresh  rate  in  the  region  of  72  to  75 
Hz  is  required  to  xhieve  tolerable  levels  of  wide-area 
flicker  (see  Section  3.2.5).  In  some  situations  refresh 
rates  in  excess  of  100  Hz  may  be  desirable.  Receivers 
whbh  operate  at  100  Hz  (double  the  normal  SO  Hz  inter¬ 
laced  scan  rate)  are  being  introduced  in  the  50  Hz  mar¬ 
ket:  rate  doubling  receivers  operating  at  120  Hz  are  also 
being  developed  for  the  60  Hz  market 

The  rebtionship  of  display  refresh  and  image  update 
rates  should  be  based  on  a  progression  that  permits  non- 
interpobtive  transformations  between  the  xquisition 
and  dispby  rates  in  the  new  architecture  (U.,  dispby  at 
integer  multiples  of  the  image  update  rate).  As  an  exam¬ 
ple.  theauical  dispby  of  film  is  usually  double  or  triple 
Shuttered  to  minimize  wide-area  flicker  of  the  dispby. 

Further  research  into  the  choice  of  a  single  family  of 
xquisiu'on  rates  and  dispby  rates  is  required.  An  appro- 
prbte  interoperable  family  should  include  a  24  or  25  fps 
image  acquisition  rate  which  would  entile  a  72  or  75  Hz 
dispby  refresh  rate.  This  is  the  subject  of  further  discus¬ 
sion  in  Sections  5, 7  and  8. 

4.4  The  Use  of  Square  Sampling  Grids 
(Square  Pixels) 

The  computer  graphics,  image  processing,  and  publish¬ 
ing  industries  have  adopted  the  use  of  geometrically 
square  pixel  sampling  grids  (frequently  simply  referred 
to  as  square  pixels).  The  use  of  square  pixels  facilitates: 

•  The  interchange  of  image  data  across  standards 
boundaries 

•  Development  of  common  storage  and  transport  ser¬ 
vices  xross  industries 

•  The  use  of  common  equipment,  allowing  the  costs  to 
be  amonixd  across  all  users 

Early  on.  the  computer  graphics  industry  sought  ways  to 
insulate  applications  from  variations  in  dispby  txhnol- 
ogy.  Suppon  for  different  pixel  configurations  required 
run-time  transformation  of  all  graphical  objects.  Even 
then,  applications  rarely  looked  the  same  from  dispby  to 
dispby  because  different  pixel  configurations  caused  a 
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variety  of  onifacts.  These  stopgap  measures  constrained 
functionality,  reduced  performance,  and  added  cost  to 
equipment  and  services.  Ultimately,  this  approach  failed. 

Instead,  computer  graphics  gravitated  towards  a  common 
display  technology  based  on  square  pixels.  This  simpli* 
fled  system  design,  which  led  to  lower  cost  and  better 
performance,  enabled  equipment  and  services  to  be  used 
as  commodities  across  a  broad  set  of  industries.  Today 
the  computer  industry  is  a  major  consumer  of  displays, 
second  only  to  consumer  television  receivers. 

The  use  of  a  common  pixel  geometry  eliminates  the  need 
for  interpolative  resampling  when  sharing  imagery 
among  all  users.  Resampling  has  two  costs: 

•  It  can  be  lossy  •>  it  can  degrade  image  quality  and 
cannot  be  reversed  without  additional  loss 

•  It  is  costly  >  it  adds  a  computational  step  or  a  hardware 
equivalent  to  the  process  of  sharing  imagery 

Thus  the  adoption  of  a  common  sampling  grid  is  a  key 
issue  for  discussion  and  resolution  in  the  SMPTE  work 
towards  the  specification  of  a  digital  image  architecture. 

4.5  The  Establishment  of  Appropriate  Rep* 
resentations  for  Colorimetry,  Dynamic 
Range  and  Transfer  Characteristics 

The  concepts  of  interoperability,  scalability  and  extensi¬ 
bility  apply  not  only  to  the  sampling  of  the  image  but 
equally  to  the  expression  of  its  brightness  and  colorime¬ 
try.  A  digital  image  architecture  must  deal  appropriately 
with  dynamic  range  and  colorimetry  requirements  of  the 
acquisition  (including  processing),  transmission  and  dis- 
pby  modules  of  a  system.  During  the  acquisition  and 
processing  of  imagery  (for  example,  post-production), 
image  dau  that  may  not  be  required  by  the  human  visual 
system  or  reproducible  on  a  given  display  may  be  re¬ 
quired  by  processing  hardware  for  optimal  results.  Simi¬ 
larly.  the  architecture  must  accommodate  image 
exchanges  between  systems  having  differing  dynamic 
range  and  colorimetry  characteristics.  The  essential  is¬ 
sues  are  summarized  in  the  sub-sections  which  follow. 

4.5.1  Extensibility 

Existing  image  systems  can  reproduce  only  a  limited 
range  of  the  colors  visible  in  the  real  world,  often  re¬ 
stricted  to  those  corresponding  to  illuminated  objects  and 
the  specific  needs  of  the  application.  The  colorimetry  of 
television  is  currently  conilned  to  that  of  the  di^lay 
device.  Figure  4.1  is  a  color  space,  within  which  are 
illusuated  a  typical  red.  green,  blue  (RGB)  gamut  of 
additive  primary  colors,  and  a  typical  yellow,  cyan,  ma¬ 
genta  (YC.M)  gamut  of  subtractive  colors.  Also  shown  is 
the  hue  and  saturation  representation:  saturation  is  the 
radial  distance  from  a  specified  white  point:  hue  is  the 
associated  angle.  Hue  and  saturation  vectors  are  shown 


pointing  to  the  RGB  and  YCM  color  gamuts,  as  well  as 
one  vector  that  extends  beyond  both.  It  can  be  seen  that 
this  representation  can  be  used  to  represent  any  visible 
color,  iliis  color  space  is  application  and  device  indepen- 
denu 

In  the  future  it  may  be  possible  and  desirable  to  extend 
the  colorimetry  represenration  to  include  a  wider  range 
of  colors,  possibly  even  iiKluding  those  of  self-luminous 
objects,  as  one  example.  A  close  examination  of  this 
issue  is  needed  to  esrablish  the  range  of  colors  to  be 
represented  within  the  colorimetry  of  the  digital  image 
architecture. 

A  similar  situmion  to  that  of  colorimetry  exists  for  the 
lepresenution  of  dynamic  range  transfer  function.  Cur* 
rent  systems  are  individually  optimized  for  the  cunent 
technology  and  applicauon  and  are  not  easily  amenable 
to  an  increase  in  dynamic  range.  Mechanisms  to  effec¬ 
tively  handle  a  much  wider  dynamic  range  need  to  be 
identified. 

4.5.2  Scalability 

To  cover  the  intended  range  of  application,  it  is  necessary 
that  the  color  and  dynamic  range  representations  be  ca¬ 
pable  of  being  scaled,  preferably  independently.  FCr  in¬ 
stance,  the  display  of  an  image  having  a  wide  color 
gamut  at  the  source  must  produce  acceptable  color  on  a 
display  of  limited  color  cs^ility.  The  reverse  situation 
is  also  true.  Similarly,' the  display  of  an  image  of  high 
dynamic  range  should  not  loose  essential  information 
when  viewed  on  a  dispby  of  low  dynamic  range.  The 
representation  must  accommodate  these  requirements  ef¬ 
ficiently. 

The  situation  is  somewhat  similar  to  that  of  motion  , 
ture  film  in  which  the  latitude  of  the  negative  film  en¬ 
ables  exposure  and  color  adjustment  after  the  image 
capture,  and  the  5-curve  of  the  film  characteristic  pro- 
vides  effective  compression  of  the  highlights  and  dark 
regions.  Similar  provisions  may  be  required  in  digital 
image  systems  to  provide  reasonable  represenutions  for 
both  small  and  large  numbers  of  bits.  A  further  consider¬ 
ation  may  concern  the  optimal  distribution  of  any  neces¬ 
sary  cofflpression/expansion  in  respect  to  overall  image 
qudity. 

Interoperability 

Interoperability  demands  that  the  diosen  colorimetric 
representation  and  the  dynamic  range  representation  be 
device  independem  for  current  and  future  devices.  In  this 
fashion,  devices  supponing  differing  colorimetries  and 
dynamic  ranges  can  supponed. 

It  is  also  important  that  images  of  differing  colorimetry 
and  dynamic  range  at  the  acquisition  device  should  be 
able  to  be  combined  effectively  into  a  single  image, 
when  appropriately  scaled. 
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The  color  space  and  dynamic  range  representations  that 
could  meet  these  objectives  require  extensive  consider¬ 
ation.  Seaion  7.7  includes  a  number  of  questions  that 
should  be  considered  in  the  analysis  of  these  and  other 
colorimeny  issues. 

4.6  The  Use  of  Coherent  Image  Sampling 
(Progressive  Scanning) 

Kisioricalty.  interlace  has  been  used  to  achieve  a  2:1 
reduction  in  bandwidth  requirements  (i.e..  data  rate),  and 
to  eliminate  wide-area  flicker  on  scanning  CRT  displays. 
The  use  of  progressive  scanning  is  nearly  univer^  in 
computer  display  applications,  and  is  employed  in  some 
high  quality  video  presentations. 

In  section  3.15  it  was  established  that  higher  scanning 
rates  are  required  with  displays  that  cover  a  wider  field  of 
view  and/or  operate  at  higher  levels  of  brightness  than 
today's  television  systems.  Decoupling  the  refresh  rate  of 
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the  display  from  the  image  update  rate  provid«  a  mech¬ 
anism  to  deal  with  wide-area  flicker  —  this  is  discussed  in 
seaion  4.3. 

‘There  are  two  methods  of  sampling  a  moving  image  that 
generate  coherent  image  samples: 

•  Traditional  proeressive  scanning  samples  the  image 
from  top  to  bo’ttom.  The  bottom  of  the  picture  is 
sampled  later  in  time  than  the  top  of  the  picture,  but 
in  any  small  area  the  samples  may  be  regarded  as 
temporally  coincident. 

•  Shuttered  devices,  such  as  film  and  most  CCD  sen¬ 
sors.  expose  all  image  elements  simultaneously.  Pro¬ 
gressive  scanning  of  film  by  an  electronic  image 
sensor  thus  results  in  coherent  image  samples.  The  use 
of  CCD  image  senson  which  employ  electronic  shut¬ 
tering  provides  a  similar  mechanism  to  film:  the  ana¬ 
log  output  of  the  CCD  can  be  s^pled  progressively 
as  the  coherent  image  data  is  shifted  out  of  the  CCD. 
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Coherent  imagery  provides  the  following  benelits  over 
interlxe: 

•  Diminates  smaiNarea  flicker  in  high  spatial  resolu> 
tion  imagery 

•  Improves  the  performance  of  digital  video  compres* 
Sion  and  many  other  processes 

•  Promotes  interoperability  by  improving  all  interpob* 
dve  processes 

•  Improves  the  quality  of  still  images  generated  from 
individual  frames 

'  Fine  horizontal  lines  or  edges  are  commonly  used  in 
computer  graphics.  Hiese  objects  do  not  flicker  when 
viewed  on  a  progressively  scanned  computer  display,  but 
may  flicker  if  viewed  on  an  interbced  television  dispby. 
Many  processes  -  including  bandwidth  compression, 
image  transformation,  and  standards  conversion  >  are 
faciliuted  by  coherent  imagery. 

The  use  of  coherent  image  samples  simplifies  the  process 
of  temporal  and  spatial  conversions  required  for  inter* 
operability  between  different  video  systems,  as  well  as 
with  other  applications  and  industries.  The  current  ptac* 
tice  in  standxds  conversion  for  interbced  video  systems 
is  to  first  de-interbee  the  imagery.  De-interixing  in¬ 
volves  the  reconstruction  of  the  missing  alternate  xan 
lines;  this  usually  requires  the  computation  of  motion 
vxtors. 

Interlxe  provides  acceptable  motion  rendition  at  lower 
data  rates  than  progressive  xanning,  and  may  improve 
xnsitivity  in  xme  camera  designs.  With  the  use  of  ap* 
prophaie  de-interbeing  equipment  in  the  studio  environ¬ 
ment.  images  acquired  with  interbced  systems  may  be 
converted  to  progressive  xan  for  subsequent  processing. 
This  tnnsformation  produces  the  best  results  when  the 
inierlxed  field  rate  and  the  progressive  frame  rate  are  the 
sme  or  are  rebted  by  integer  multiples. 

Coherent  sampling,  along  with  the  ux  of  mxhanical  or 
electronic  shutters,  can  improve  both  motion  video  and 
still  image  acquisition,  allowing  video  cameras  to  be 
used  for  a  bruader  nnge  of  applications. 

The  ux  of  progressive  scanning  is  xen  by  some  as  a 
requirement  for  a  digital  image  architecture,  especially 
when  interoperation  across  applications  and  industries  is 
a  requirement. 

4.7  Identification  of  the  Characteristics  of  a 
Digital  Image  Stream  (Headers/Descriptors) 

A  fundamenul  prexquisite  for  interoperability  in  digiul 
systems  is  a  mxhanism  for  identifying  and  describing 
digital  image  data.  For  this  information  to  be  shared, 
dxoders  must  be  capable  of  identifying  and  conforming 
to  the  incoming  data.  Even  simple  dxoders  -  thox  that 


only  recognix  a  sbgle  standard  >  must  identify  data 
streams  which  they  can  decode.  This  is  one  of  the  pri¬ 
mary  fuxtions  of  the  header.  Decoders  must  also  ignore 
unrecognized  data,  to  allow  for  extensions  to  the 
stream. 

Dexriptors  provide  application  oriented  information, 
such  X  image  and  coding  parameters,  processing  history, 
identification  of  program  content,  copyright,  and  scram¬ 
bling.  They  also  enable  extensibility*,  the  descriptor  may 
also  contain  the  coding  algorithm  or  language  reprexn* 
lotion  nxessary  to  interpret  the  encapsulated  data.  This 
provides  a  mxhanism  whereby  expert  groups  can  create 
and  standardbe  the  transmission  of  messages  to  meet 
their  needs. 

Descriptors  may  be  uxd  to  identify  and  describe  daa  at 
different  levels  of  an  image  hierarchy,  thus  allowing  a 
dispby  system  to  dxode  only  that  part  of  a  stream  nec¬ 
essary  for  its  function  or  capability.  Dexriptors  might 
also  contain  information  about  the  preferred  diqtby 
characteristics  for  imagery. 

Thus  information  sxh  as  the  colorimeov  of  the  original 
acquisition  system,  and  the  transfer  charxieristics  of  the 
process  uxd  to  move  images  from  one  medium  to  an¬ 
other.  can  be  included  with  the  data.  Dxoders  would  ux 
this  informxion  to  optimize  display  of  the  image. 

The  SMF7S  Task  Force  on  Headeis/Dexriptors  in  their 
Final  Report  dated  January  3. 1992.  and  approved  by  the 
SMPTE  Standards  Commiux  on  February  6.  outlined 
the  criterb  for  the  ux  of  Headers/Dexriptors.  Work  is 
now  progressing  on  the  development  of  proposed 
SMPTE  Standards.  Rxommended  Practices  and  Engi- 
nxring  Guidelines. 

4.8  Compatibility  with  Current  Television 
and  Motion  Picture  Standards 

The  installed  bax  of  NTSC.  PAL  and  SECAM  equip¬ 
ment  within  the  program  production  community,  to- 
ge'.her  with  massive  consumer  investment  in  compatible 
rxeivers  and  VCRs,  must  be  supported  in  the  transition 
to  a  digital  image  archiixture.  Of  even  greater  impon- 
ance  is  the  requirement  to  preserve  the  value  of  the 
archives  of  programs  that  have  been  created  for  mass 
market  disuibuiion  using  thex  systems  and  to  expbit 
thex  resources  to  the  greatest  extent  possible  in  the 
future. 

This  is  by  far  the  most  critical  issue  of  all,  so  mxh  so 
that  its  impxi  is  clear  in  the  dixussions  of  many  of  the 
previous  issues.  Only  the  last  of  them,  the  ux  of  head- 
ers/dexripiors.  is  without  prexdent  in  existing  enter¬ 
tainment  industry  practice.  It  is  precisely  where  a 
dichotomy  exists  in  current  prxtice  that  the  greatest 
controversy  arises  -  on  the  issue  of  temporal  rates. 
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The  convergence  in  being  digital  may  provide  the  solu* 
i»ns  which  will  resolve  the  temporal  rate  issue;  conver¬ 
gence  around  the  common  language  of  digital  coding,  the 
progression  in  Q>U  performance,  and  the  ability  to  de¬ 
sign  inexpensive  modular  interfaces  in  the  form  of  mass 
produced  microchips. 


It  is  likely  that  a  number  of  solutions  will  evolve  to 
facilitate  interoperability  between  the  existing  world  of 
nim  and  analog  television,  and  the  new  digital  image 
architecture.  These  solutions  should  provide  a  variety  of 
price/performance  options  appropriate  to  the  applica¬ 
tions  requirements. 
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5  An  Example  of  a  Hierarchical  Digital  Image  Architecture 


This  section  suggests  a  technology  transparent  hierarchy 
-  one  compatible  with  the  present  and  extensible  for  the 
future. 

To  illustrate  the  model.  speciHc  numbers  have  been  cho* 
sen  that  take  advanuge  of  the  mathematical  relationships 
discussed  in  Section  4.  as  well  as  the  architectures  of 
digital  memory  and  processing  components.  These  num¬ 
bers  are  not  intended  as  the  basis  for  a  standard,  but 
rather,  provide  a  starting  point,  from  which  the  validity  of 
the  architectural  concepts  can  be  verified.  Further  work 
is  required  for  verification  of  the  model  and  determina¬ 
tion  of  the  exact  numbers  upon  which  a  standard  can  be 
based  (see  Section  7). 

The  following  parameters  of  a  hierarchical  digital  image 
atchitecaire  are  discussed  in  this  section; 

•  Open  Architecture 

•  Multiple  Spatial  Resolutions 

•  Image  Acquisition  Rates  and  Dispby  Refresh  Rates 

•  Scalable  Coding  Algorithms 

5.1  Open  Architecture 

In  Section  3  it  was  indicated  that  the  opportunity  exists  to 
design  an  open  digital  image  architecture  based  on  ge¬ 
neric.  inexpensive,  and  increasingly  powerful  digital 
components. 

For  a  digital  image  architeaure  to  be  cast  as  an  open 
system,  two  steps  are  required: 

•  Modularization  into  acquisition,  transmission  and 
display 

•  Defining  standard  interfaces  between  these  modules 

There  must  be  a  systems  engineering  of  the  standards  so 
that  the  modules  work  together.  There  are  two  basic 
interface  defmitions  to  be  publicly  standardized: 

•  Headers/descriptors  -  a  data  definition  interface  be¬ 
tween  the  acquisition  and  display  modules. The  head¬ 
ers/descriptors  issue  (Section  4.7)  attempts  to  address 
how  the  acquisition  and  display  segments  interact. 

•  Communications  service  definition  •  this  interface 
defines  how  the  acquisition  and  dispby  modules  sate 
communications  requirements  to  the  transmission 
module.  This  is  the  vehicle  for  an  end  sub-system  to 
sate  its  communications  requirements  (btency,  bit 
rate,  error  budget,  synchronization,  interactivity, etc.) 
to  the  uansmission  system  and  for  the  transmission 
sub-system  to  inform  the  display  sub-system  of  these 
parameters. 


Some  of  the  parameters  that  should  be  pan  of  this  com¬ 
munications  service  definition  include: 

•  Error  tolerance  -  some  dan  types  and  some  applicak- 
tions  require  bit-perfect  delivery.  Headers/descriptns 
and  database  transactions  are  obvious  examples.  On 
the  other  hand,  some  video  and  audio  applications 
may  be  able  to  stand  some  noise  -  how  much  being  a 
human  perceptual,  engineering  and  economic 
tradeoff. 

•  Delivery  guarantees -some  applications  require  leli- 
able  delivery,  where  lost  packets  are  retried  antO 
delivery  is  xknowledged.  Others,  such  as  live  view¬ 
ing  or  remote  dispby,  requin  that  lost  packets  not  be 
recovered. 

•  Latency  -  the  amount  of  tolerable  time  between  ac¬ 
quisition  and  dispby,  in  an  acuve  system 

•  Synchronized  service  requirement  -  multiple  types  of 
<iaM  that  need  to  be  delivered  together 

This  careful  modularization  encapsubtes  other  lanes, 
itKluding  the  critical  issues  discussed  in  Section  4,  so 
that  they  can  be  addressed  one  by  one. 

It  can  be  argued  that  there  is  no  need  for  rigid  arehiiBe- 
tural  standards  in  a  digital  world;  chat  programmability  in 
the  transmission  and  display  hierarchies  provides  a  suGB- 
cient  basis  for  interoperability.  Perhaps  some  day  this 
will  be  true.  If  the  goal  of  longevity  for  the  first  digital 
image  architecture  is  achieved,  it  is  likely  that  the  design¬ 
ers  of  the  next  imaging  architecture  will  be  less  con¬ 
strained  than  we  are  today. 

The  first  digital  image  architecture,  however,  must  pre^ 
vide  a  bridge  from  the  closed  systems  of  the  past  to  the 
open  systems  of  the  future.  The  fundamental  structure  of 
the  digital  building  blocks  and  economies  of  scale  asst^ 
cbted  with  sandardization  suggest  that  the  organizations 
charged  with  esublishing  these  standards  work  in  har¬ 
mony. 

S2  Designing  Display  Systems  to  Deal  with 
Multiple  Spatial  Resolution  Requirements 

The  perceived  resolution  of  a  dispby  is  determined  pri¬ 
marily  by  the  viewing  disance  and  the  visual  acuity  of 
the  observer.  Visual  acuity  is  often  determined  using  sets 
of  alternating  bbek  and  white  lines  of  equal  width.  Oie 
biack/white  line  pair  represents  one  cycle.  The  number  of 
cycles  that  can  be  resolved  across  one  degree  of  the  eye's 
viewing  field  is  lypicaiiy  used  as  a  measure  of  human 
visual  acuity,  and  is  soted  in  cycles  (line  pairs)  per  te- 
gree.  Under  some  conditions,  with  high  conuast  line 
pairs,  human  visual  acuity  extends  beyond  40  cycles  per 
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degree;  approximately  22  cycles  per  degree  is  perceived 
as  a  sharp  image. 

If  the  resolution  of  a  display  is  held  consunt  and  the 
viewing  distance  is  a  variable,  the  resolution  perceived 
by  the  viewer  -  measured  in  cycles  per  degree  -  will 
increase  as  the  viewer  moves  away  from  the  dispby. 
Therefore,  all  dispbys  can  be  considered  to  be  high  reso> 
lution  if  viewed  tom  an  appropriate  distance. 

At  a  distance  that  varies  with  the  visual  acuity  of  each 
individual,  the  aaual  resolution  of  the  display  equals  the 
limit  of  that  viewer’s  ability  to  resolve  image  detail. 
Beyond  this  viewing  distance  additional  image  detail 
cannot  be  perceived:  that  is.  the  dispby  has  more  resolu¬ 
tion  than  is  required  for  this  viewer  and  set  of  viewing 
conditions. 

In  some  eases  excess  resolution  may  be  desirable.  For 
example,  the  opentor  of  a  personal  computer  can  typi¬ 
cally  reduce  the  viewing  distance  to  a  high  resolution 
desktop  dispby  by  one-half,  simply  by  leaning  forward, 
thus  taking  advanuge  of  additional  resolution.  In  this 
example  perceived  resolution  improves  enough  to  be 
significant,  while  moving  15  inches  in  a  movie  theatre 
would  have  little  effect  on  perceived  resolution. 

The  NTSC  transmission  standard  was  designed  to  pro¬ 
vide  a  resolution  of  approximately  21  cycles  per  degree 
over  a  viewing  field  of  just  under  11  degrees.  Display 
sixe  can  be  a  variable  in  today's  television,  ranging  tom 
a  dbgonal  of  a  few  inches  (a  personal  dispby)  to  more 
than  30  feet  (direct  view  displays  in  sudiums  and  projec¬ 
tion  dispbys  in  controlled  lighting  environments).  These 
displays  differ  only  in  the  size  of  their  pixels.  At  the 
appropriate  viewing  distances,  the  perceived  resolution 
of  the  personal  display  and  the  stadium  dispby  will  equal 
the  design  goal  of  2 1  cycles  per  degree,  and  both  displays 
will  cover  11  degrees  of  the  observer's  field  of  view. 

Many  dispby  applications  require  higher  levels  of  per¬ 
ceived  resolution.  To  increase  the  level  of  perceived  res¬ 
olution.  while  holding  viewing  distance  constant, 
additional  samples  of  the  same  image  must  be  added, 
increasing  pixel  density.  To  cover  a  wider  field  of  view, 
as  in  wide-screen  dispbys.  holding  the  same  viewing 
distance  and  perceived  resolution,  new  information,  at 
the  same  pixel  density,  must  be  added  to  extend  the 
picuire. 
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Section  3.2  identified  the  need  for  a  variety  of  image 
resolutions  to  deal  with  specific  imaging  requirements. 
These  ranges  can  now  be  further  defined  in  terms  of  field 
of  view  and  resolution  in  cycles  per  degree. 


With  personal,  home  entertainment  and  theatre  displays, 
the  viewer  can  vary  the  disonce  tom  the  display,  aind 
thus  vary  the  perceived  resolution,  over  a  significant 
range  (see  Figure  5.1).  Taking  into  account  the  variations 
in  xuity  in  the  popubtion.  and  variations  in  viewing 


distance  for  each  application,  it  is  common  practice  to 
design  a  display  system  for  the  average  viewing  condi¬ 
tions  in  each  application.  The  overlaps  in  cycles  per 
degree  between  low,  normal  and  high  resolutions  are 
shown  in  the  table  to  xcouru  for  these  variations. 

Resolution 

Cycles  per  Degree 

Low 

1-15 

Normal 

10-25 

High 

20-30 

Ultra  High 

30-40 

A  special  case  exists  for  head  mounted  disp^y$  which 
provide  a  fixed  viewing  distartce:  here  the  display  manu- 
fxiurer  must  select  the  level  of  resolution  appropriate 
for  the  application  and  then  design  for  a  specific  per¬ 
ceived  resolution. 

Using  these  guidelines,  a  high  resolution  display  de¬ 
signed  for  a  35  degree  field  of  view  would  require  about 
two  thousand  pixels  per  line  at  30  cycles  per  degree.  In  a 
desktop  computing  application  where  the  viewer  is  30 
inches  tom  the  dispby.  the  length  of  an  active  line 
(dispby  width)  would  be  about  19  inches.  In  an  enter¬ 
tainment  application,  such  as  a  consumer  television  re¬ 
ceiver  viewed  tom  a  disunce  of  108  inches  (9  feet),  the 
length  of  an  active  line  would  be  about  68  inches. 

These  examples  are  illustrated  in  Figure  52.  In  this  fig¬ 
ure  the  principles  described  in  this  section  are  used  to 
illustrate  the  relationships  between  the  four  resolution 
levels  of  the  model  hierarchy  and  a  variety  of  dispby 
applications.  The  numbers,  especblly  as  they  rebte  to 
image  size  (in  pixels)  are  entirely  relative;  they  serve 
only  as  examples  of  the  pixel  count  required,  at  average 
viewing  distances  and  fields  of  view,  to  achieve  the  spec¬ 
ified  perceived  resolution. 

It  is  imponant  to  note  that  seemingly  diverse  applications 
such  as  personal  computer  and  home  entertainment  dis¬ 
pbys  will  have  similar  tesoluu'on  requirements  as  the 
size  of  the  home  enienaiiunent  dispby  increases  beyond 
the  narrow  field  of  view  of  today's  television  receivers.  It 
is  also  important  to  note  that  direct  view  GIT  displays 
(which  are  currently  limited  to  around  40  inch  dbgonals) 
require  resolution  in  the  normal  range  for  home  enter¬ 
tainment  applications. 
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Figure  5.1  -  Relative  Tile  Resolutions  -  These  groups  of  letters  represent  the 
relative  resolution  for  each  level  of  the  hierarchy  from  Level  1  (top)  to  Level  4.  To  better 
understand  the  practical  application  in  displays,  place  this  figure  where  it  can  be  viewed 
from  a  distance  of  between  30  inches  and  15  feet.  Level  4  should  be  sharp  at  30  inches; 
as  you  move  away  each  level  lower  in  the  hierarchy  will  become  sharp. 
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Rtsolutlon  Awrage 

Uvel  f;®"* 
of  View 


Level  1 
Low 

Resolution 


Level  2 
Normal 
Resolution 


Level  3 

High 

Resolution 


Level  4 
Ultra  High 
Resolution 


Average 

Target 

Viewing 

Display 

Distance 

H  X  V  (rounded) 

30' 

irxr 

76  cm 

30  X  1 8  cm 

tor 

44"x2r 

274  cm 

112  X  64  cm 

30" 

i2"xr 

76  cm 

30  X  1 8  cm 

108" 

44"  X  25" 

274  cm 

112  X  64  cm 

30" 

19"x11" 

76  cm 

48  X  28  ctn 

108" 

6r  X  38" 

274  cm 

1 73  X  97  cm 

360"  ! 

ZlTxMT 

914  cm 

577  X  323  cm 

108" 

101"  X  57^ 

274  cm 

257  X  145  cm 

360" 

336"  X  189" 

91 4  cm 

853  X  480  cm 

30" 

28"x  16" 

76  cm 

71  X  41  cm 

108" 

101"  X  57" 

274  cm 

257  X  145  cm 

360" 

336"  X  189" 

914  cm 

853  X  480  cm 

Pixels 

Hx  V 

(square) 

Approximate 

Resolution 

cycles/degree 

512 

X 

10 

288 

1024 

X 

20 

576 

2048 

X 

1152 

30 

20 

4096 

X 

40 

2304 

Field  of  View 


AVERAGE  VIEWING  DISTANCE 

Distance 

Applications 

30" 

76  cm 

Personal  and  Computer  Displays 

108" 
274  cm 

Home  Entertainment  Systems 

360" 
914  cm 

Theatrical  Display  and 
Business  Presentation 

Viewer 


Figure  5.2  -  Example  16x9  Aspect  Ratio  Displays 
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5.3.1  Key  Concepts  of  the  Model 

The  example  spatial  resolution  hierarchy  is  designed 
around  a  few  basic  concepts: 

•  An  integer  progression  of  hierarchy  levels  based  on 
commonly  used  digital  processing  and  memory  archi* 
lectures 

•  Support  for  a  variety  of  aspect  ratios  and  spatial 
resolutions  based  on  the  concept  of  image  tiles 

•  The  ability  to  construct  displays  for  vinually  any 
application  requirement  firom  tiles  of  the  appropriate 
resoludon 

The  hierarchy  progression  is  based  on  the  use  of  integer 
values  related  by  powers  of  two.  Essentially,  at  each 
higher  level  of  the  hierarchy,  resolution  doubles  (e.g.,  1. 
2. 4, 8,  etc.);  subsets  of  the  lowest  level  can  be  derived 
similarly  (1/2. 1/4, 1/8.  etc.). 

It  is  noteworthy  that  such  sequences  also  appear  in  the 
computer  processor  and  memory  component  industry. 
This  approach  takes  full  advantage  of  the  genetic  build* 
ing  blocks  that  are  the  driving  force  in  the  transition  to  a 
digital  world. 

In  order  to  provide  continuity  between  the  various  reso¬ 
lution  levels  of  the  hierarchy,  the  model  is  based  on  the 
concept  of  an  image  die.  For  the  purposed  of  this  discus¬ 
sion.  a  tile  can  be  considered  to  be  a  constant  pordon  of 
an  image,  represendng  the  same  pan  of  the  image  regard¬ 
less  of  the  resolution  level  or  image  size.  Thus,  at  each 
higher  level  in  the  hierarchy,  the  resolution  within  a  tile 
doubles  in  each  axis.  This  is  illustrated  in  Rgure  5  J. 

The  power  of  two  progression  may  now  be  applied  to 
determine  the  resolution,  in  pixels,  for  each  level  in  the 
hierarchy. 


Uvel 

Name 

Resolution 
in  Cycles 
perEiegree 

Pixels  b 
One  Tile 

Pixels  in  32 
X  32  Tile 
Superset 

1 

Low 

1-15 

16  X  16 

512x512 

2 

Normal 

10-25 

32x32 

1024  X  1024 

3 

High 

20-30 

64x64 

2048  X  2048 

4 

Ultra 

High 

30-40 

128x128 

4096  x  4096 

In  this  model  a  die  represents  an  area  equal  to  l/32nd  of 
the  image  at  any  level  of  the  hierarchy.  Thus  each  level 
consists  of  a  32  x  32  set  of  dies  (see  Figure  S  J).  The 
selection  of  this  fraction  for  a  die  is  arbitrary;  it  was 
chosen  because  it  is  a  convenient  building  block  -  integer 


multiples  can  be  used  to  construct  displays  at  all  of  the 
aspect  rados  and  spatial  resoludons  discussed  in  the  model 

5.3.2  Consoruction  of  Displays  from  Tiles  of  the 
Appropriate  Resolution 

The  Dble  in  Figure  |ROvides  a  matrix  of  display 
aspect  rados  and  resolutions  that  can  be  derived  from  the 
full  set  of  32  X  32  tiles  at  each  level  Since  the  die  size  is 
a  constant,  each  column  represents  a  constant  size  dis¬ 
play  at  four  perceived  levels  of  resolution. 

The  diagram  in  Figure  S3  establishes  several  important 
leladonships  that  provide  a  bridge  to  the  past  and  illustr¬ 
ate  how  interoperability  can  be  achieved: 

•  Region  A  (43)  shows  the  reladve  ^dal  resoludon 
of  S2S  line  systems. 

•  Region  B  (4:3)  show  the  relative  spatial  resoludon  of 
625  line  systems. 

•  Region  C  (16:9)  is  representative  of  the  dispby  that 
some  industry  observers  expect  will  be  used  in  the 
next  generation  of  home  entertainment  systems. 

•  Region  D  (4:3)  is  commonly  nsed  in  computer  and 
entertainment  systems.  R  allows  the  incorporation  of 
regions  A.  B,  and  C.  which  correspond  to  traditional 
video  and  film  aspect  rados.  along  with  additional 
areas  for  the  display  of  Graphical  User  Interface 
(GUI)  elements.  It  is  important  to  note  that  the  1024 
X  768  and  640  x  480  resoludons  •  shown  for  level  2 
-  are  both  popular  dispby  resoludons  for  personal 
computer  systems. 

•  Region  E  (1:1)  is  emerging  as  a  new  display  aspect 
lauo  for  certain  applications.  The  FAA  recenUy  began  * 
procurement  of 2048  x  2048  square  pixel,  progressive 
scan  QIT  dispbys  for  air  traffic  control  systems. 

The  tile  concept  can  similarly  be  tqrpUed  to  the  manufac¬ 
ture  of  dispbys.  In  this  case,  a  physical  di^by  tile  would 
correspond  to  a  conceptual  die  and  would  have  different 
physical  sizes  for  different  size  dispbys  and  difTerent 
pixel  densities  for  different  resoludon  requirements. 
Similarly,  dispbys  of  different  aspect  rados  could  be 
constructed  by  the  selecdon  of  the  appropriate  concep¬ 
tual  dies  as  shown  in  Rgure  53. 

Thus,  using  dies  and  only  four  resoludon  levels,  it  is 
possible  to  construct  a  dispby  for  virtually  every  possi- 
bb  application:  furthermore  this  dispby  can  also  be  used 
10  show  imagery  from  other  levels  of  the  hienrchy.  This 
is  especially  pracdcal  if  a  scabble  coding  architecture  is 
implemented  that  conforms  to  the  same  resolution  pro¬ 
gression. 
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TILE 

RESOLUTION 

NUMBER  OF  TILES  IN  DISPLAY 

IN  PIXELS 

20  X  15 

24  X18 

32X18 

32X24 

32X32 

R«latlv«  tile  resolutions  are 

4:3  Subset 

4:3  Subset 

16:9  Subset 

4:3 

1:1 

shown  below 

Region  A 

Region  B 

Region  C 

Region  0 

Region  E 

LEVEL  1 

H-16 

H*320 

H  =  384 

H*512 

H.512 

H.512 

Low 

X 

X 

X 

X 

X 

X 

Resolution 

V«  16 

V  *  240 

V.288 

V.288 

V.384 

V.512 

LEVEL  2 

H«32 

H  m  640 

H  .768 

H»  1024 

H.1024 

H.1024 

Normal 

X 

X 

X 

X 

X 

X 

Resolution 

V«32 

V«460 

V.576 

V.576 

V.768 

V.1024 

LEVEL  3 

H«64 

H.1280 

H  «  1536 

H  .  2048 

H  .  2048 

H  .  2048 

High 

X 

X 

X 

X 

X 

X 

Resolution 

V»64 

V.960 

V*  1152 

V.  1152 

V.  1536 

V  .  2048 

LEVEL  4 

H*  128 

H*  2560 

H  «  3072 

H  «  4096 

H  «  4096 

H  «  4096 

Ultra  High 

X 

X 

X 

X 

X 

X 

Resolution 

V.  128 

Vs  1920 

V  s  2304 

V  .  2304 

V  .  3072 

V  .  4096 

E  32X32 

ffiWFF 


D  32  X  24 


iB-24>X'18i 


A  20  X  15 


B5BBin 

HBBBBI 

BBBaai 

dBBsai 

BBBBBI 


III 

HBaSBesI 

HBBSBEfSl 

HBBBBeBl 

hbbbbbeI 

hbeibssibI 

nBasBoal 
l■BaBBeB| 
[B  BtB  BBSSiSBl 
IBBBBC!OE!l 

■BBaQesil 
BBBBBei 

■  BBOBESal 

■  BBSBiiBi 
IBBB  BSBSl 


Figure  5.3  -  Display 
Constructions  Using 
Multiple  Tiles 
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5.4  A  Family  of  Related  Image  Acquisition 
Rates  and  Display  Refresh  Rates 

A  family  of  image  acquisiuon  and  display  refresh  rates 
should  be  based  on  a  progirssion  that  permits  non>tnter- 
polauve  transformations  between  the  acquisiuon  and  dis* 
pby  rates.  This  is  easily  implemented  if  the  acquisition 
and  display  rates  are  the  same,  or  if  the  dispby  refresh 
rate  is  an  integer  multiple  (or  fraction)  of  the  image 
acquisition  rate. 

Since  significant  archives  of  high  resolution  program 
material  exist  on  film,  which  was  acquired  at  24  or  25 
fps.  one  of  these  rates  should  be  included  in  the  progres* 
Sion.  A  progression  based  on  integer  multiples  of  12 
would  include  12. 24. 36, 48. 60. 72. 96. 120  Hz.  etc.  A 
progression  based  on  integer  multiples  of  12J  would 
include  12  J.  25. 50, 75, 100, 125  Hz.  etc.  These  progres* 
sions  might  also  include  integer  fractions  of  12  or  12J 
(e.g..  1/2  or  1/4  of  the  base  frame  rate  for  applications 
such  as  videoconferencing  and  searching  of  video 
databases.) 

It  has  been  common  practice  in  Europe  to  display  24  fps 
film  at  25  fps  for  compatibility  with  PAL  and  SECAM; 
this  results  in  a  4%  speed  increase.  Many  European  pro* 
grams  produced  for  television  disuibuuon  are  acquired  at 
25  fps:  if  the  family  of  rates  is  based  on  24  fps.  these 
programs  would  be  played  4%  slower.  As  indicated  in 
Section  4.8.  further  research  is  required  to  determine  the 
impxt  of  choosing  one  of  these  rates  on  those  industries 
that  utilize  film  for  image  acquisition. 

Ideally,  compatibility  with  existing  electronic  imaging 
systems  would  be  accommodated  in  the  design  of  the 
standard  modules  that  will  interface  these  systems  with 
the  digital  image  architecture.  By  design,  this  would 
place  the  burden  of  compatibility  on  the  systems  that  are 
being  replaced  rather  than  products  that  conform  to  the 
new  architecture:  thus  the  future  will  not  be  constrained 
by  today's  limiDtions. 

In  the  process  of  developing  the  existing  analog  and 
digital  high  resolution  teie%’ision  systems,  the  designers 


of  these  systems  have  demonstrated  the  practbality  of 
such  a  modular  approach  to  interoperabib'ty.  A  variety  of 
trenslation  devices  have  been  demonsoated  that  altow 
interoperation  between  PAL.  NTSC.  HD*MAC  and 
MUSE.  The  interface  modules  that  will  be  required  to 
transform  the  signals  from  these  systems  (especially 
NTSC  and  PAL)  into  the  new  architecture  offer  the  po¬ 
tential  for  large  volumes.  It  is  likely  that  the  market  for 
these  modules  will  be  characterized  by  intense  competi* 
tton.  leading  to  a  range  of  solutions  at  various  priced- 
formance  levels. 

In  the  near  tem  the  choice  ofa  family  of  rates,  based  on 
12  or  12J  Hz.  would  provide  optimally  low  cost  and 
high  performance,  for  both  advanced  television  and  com¬ 
puter  uses,  as  well  as  providing  global  interoperability.  In 
the  longer  term,  decoupling  of  acquisition,  transmissioo 
and  display  is  likely  to  lead  to  entirely  new  approaches  to 
pixel  replenishment  that  may  render  the  current  concept 
of  image  acquisition  rates  and  display  refresh  rates 
meaningless. 

5.5  Scalable  Coding  Algorithms 

Scalable  image  decomposidon  offers  the  ability  to  pro¬ 
duct  image  data  in  packages  that  can  be  combined  to 
produce  images  at  a  variety  of  spadal  and  temporal  rcso- 
luiions.  Decoding  and  displaying  the  bwest  frequency 
image  packets  would  produce  an  image  at  the  first  level 
of  the  hierarchy.  Addidonal  packets  (encoded  with  spo- 
tiaJ  and  temper^  differences)  would  be  decoded  to  pro¬ 
duce  images  at  higher  levels  of  the  hierarchy. 

This  approach  enables  extensibility.  For  example,  the 
coding  of  low  resolution  imagery  might  remain  on- 
changed  to  provide  compatibility  with  existing  decoders, 
while  new  coding  methods,  made  possible  by  the  geo¬ 
metric  progression  in  compuutional  hardware,  can  be 
introduced  to  support  more  advanced  imagery.  Increas¬ 
ingly  powerful  (and  affordable)  programmable  decoders 
can  provide  compatibility  with  the  standards  that  from 
the  foundation  of  the  digital  image  architecture,  and  the 
additional  processing  power  required  for  future  enhance¬ 
ments  to  the  architecture. 
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6  Industries  and  Applications  Considered 

6.1  Industries  and  Applications 

Industries  are  categorized  by  current  market  segmenu.  It 
is  important  to  keep  in  mind  that  convergences  among 
existing  industries  will  likely  occur  (e.g.,  computers  and 
consumer  electronics:  audio,  video,  and  datacomm),  and 
as  new  opportunities  to  provide  value  products  and  ser¬ 
vices  emerge,  entirely  new  industry  segments  will  un¬ 
doubtedly  come  forth. 

It  is  becoming  dilHcult  to  draw  the  line,  even  today, 
between  consumer  electronics  and  computers.  Today's 
video  game  machines,  already  in  millions  of  homes,  are 
marketed  as  consumer  accessories  to  televisions,  but  are 
in  fact,  more  compuutionally  competent  than  personal 
computers  of  only  a  few  years  ago.  Similarly,  personal 
computers  are  being  marketed  to  the  home  market 
through  traditional  consumer  electronics  channels. 

Traditional  business  factors  should  always  be  consid¬ 
ered.  These  include  equipment  replxement  costs,  amor¬ 
tization.  benefits,  competition,  market  needs,  and  access 
to  material. 

Successful  industry  panicipants  will  pay  close  attention 
to  emerging  trends  and  will  help  to  bring  them  about. 
Sometimes,  deep  pockets  may  be  required  to  create  a 
market.  (It  took  years  of  major  losses  in  both  equipment 
and  programming  efforts  before  color  television  became 
profitable.)  In  contrast,  agreement  on  a  common  archi- 
[  tecture  across  a  wide  range  of  indusuies  and  applications 
would  spread  the  costs  and  encourage  early  adoption. 

The  groupings  used  for  this  Report  help  to  rebie  applica¬ 
tion  requirements  to  industries.  It  is  well  understood  that 
there  is  already  much  overbp  between  industry  groups 
and  applications. 

The  industry  groupings  are  as  follows: 

•  Entertainment  Providers  (6.1.1) 

•  Distribution  and  Communications  (6.1.2) 

•  Commercial  Equipment  Manufacturers  (6.1  J) 

•  Consumer  Electronics  Manufacturers  (6. 1 .4) 

•  Computers  and  Information  6.  U) 

•  Education  (6.1.6) 

•  Engineering  and  Science  (6. 1 .7) 

•  Healthcare  (6.1.8) 

•  Military  and  Aeronautics  (6. 1 .9) 


6.1.1  Entenainment  Providers 

Entertainment  provider  Helds  include  programming,  ani¬ 
mation.  games  (personal  and  arcade),  broadcasting,  cine¬ 
matography,  post-production,  theatrical  presenution, 
and  pre-recorded  media. 

The  technologies  used  in  these  fields  are  highly  depen¬ 
dent  on  downsueam  profits.  It  can  be  difficult  to  justify 
large  investments  (e.gM  an  HDTV  production  facility)  in 
new  technologies  that  can  only  be  utilized  by  a  small 
portion  of  their  markeu  Smaller  investments  ths  require 
minimal  infrastructure  changes  (e.g.,  MTS  stereo,  VHS- 
HQ)  can  be  more  easily  justiHed.  parucularly  when  end- 
users  can  beneHt  with  existing  equipment  or  apid 
upgrade  is  anticipated.  Backward  compatibility  and  ex¬ 
tensibility  are  key  issues  here  and  can  only  be  success¬ 
fully  violated  when  there  are  substantive  tnmeHts  to  the 
end  user  (e.g.,  audio  compact  disc). 

Revenue  streams  can  often  be  anticipated  to  flow  well 
beyond  the  initial  release  of  the  producL  Residuals  from 
syndicaUon,  rentals,  and  sales  require  that  providers  an¬ 
ticipate  future  trends  in  end-user  viewing  equipment  ca¬ 
pabilities.  This  is  one  reason  why  most  prime  time 
television  is  shot  on  35mm  Him  and  not  video. 

6.1.2  Distribution  and  Communications 

The  distribution  and  communication  industries  that  wiU 
be  affected  by  digital  image  systems  include  telephone, 
television  bro^casting  and  cable  TV.  utilities,  videocon¬ 
ferencing,  electronic  mail  (including  text,  data,  image, 
animation,  video,  and  sound),  and  mobile  communica¬ 
tions.  Carrier  channels  that  will  pby  a  yet-to-be-deter- 
mined  role  in  this  process  include  optical  Hbeis.  broad 
and  spot-beam  satellites,  microwave,  cellular,  conveit- 
tional  VHF/UKF  terrestrial  broadcast,  broadband 
cable,  and  local  and  wide  area  networks.  Also  impacted 
will  be  video  ope.  video  disc.  game,  and  general  soft¬ 
ware  distribution. 

There  is  some  effort  to  establish  a  video  dialtone  similar, 
in  concept,  to  today's  voice  telephone  dialtone.  As  com¬ 
munication  networics  increase  bandwidth,  and  compres¬ 
sion  technologies  improve,  an  increased  use  of  remote 
real-time  visual  communications  can  be  expected. 

These  same  advancements  also  facilitate  rapid 
downloading  of  video  information  from  media  servers:  at 
a  100:1  compression  ratio,  the  data  for  a  typical  motion 
picture  could  be  transmitted  in  a  few  minutes  over  a  full 
video-bandwidth  capable  network. 

Because  of  the  universal  proliferation  and  converston 
standards  for  the  telephone,  it  is  likely  that  we  will  soon 
see  extensions  of  current  fax  standard  including:  voice 
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fax  (voice  mail),  high  resolution  color  image  fax.  and 
video  fax  (video  mail).  One  of  the  driving  forces  behind 
the  development  of  the  JPEG  image  compression  sun* 
dard  vvas  the  need  for  an  efficient  dau  r^uction  tech* 
nique  for  the  transmission  of  still  images. 

The  telecommunications  industry  is  well  down  the  road 
in  the  establishment  of  digital  imaging  standards.  The 
CCm.  which  controls  fax  standards,  worked  with  the 
IEEE  on  the  JPEG  standard  and  on  the  videoconferenc¬ 
ing  standard  known  as  P.64  or  H.261.  These  groups  are 
also  lesponsible  for  the  MPEG  family  of  moving  picture 
standards.  JPEG  and  MPEG  I  and  P.^  form  the  basis  for 
the  first  generation  of  image  telecommunications  prod¬ 
ucts  that  are  already  starting  to  reach  the  market. 

These  standards  were  designed  with  a  high  degree  of 
flexibility  to  deal  with  a  variety  of  imaging  applications: 
they  have  served  as  excellent  examples  for  the  Task 
Force  in  the  area  of  interoperability  and  scalability.  Cur¬ 
rently  the  MPEG  group  is  working  on  extensibility; 
MPEG  n  is  targeted  for  the  delivery  of  higher  quality 
moving  image  dau  streams  in  the  range  from  two  to  forty 
megabits  per  second.  The  MPEG  working  group  is  inves¬ 
tigating  scabbility  as  a  requirement  for  this  extension  of 
MPEG.  It  would  be  beneficial  for  these  new  standards  to 
relate  harmoniously  to  other  digital  imaging  architec¬ 
tures. 

The  merging  of  both  broadcast  and  interactive  voice, 
image  (including  graphics  and  video),  text,  and  data 
across  diverse  transport  media  will  create  challenges  in 
properly  matching  the  information  with  the  delivery 
mechanism.  Current  efforts  to  implement  interactive 
television,  for  example,  use  differing  transmission  media 
for  each  direction  (e.g..  broadband  in;  telephone  or  cellu¬ 
lar  radio  out). 

Factors  such  as  existing  infrastructure,  projected  time 
and  cost  to  deploy,  bandwidth  cost,  regulatory  issues, 
nature  of  the  signal,  target  viewer,  compression,  error 
sources.  localization,  security,  btency.  etc.,  need  to  be 
considered. 

The  communications  infrastructure  deployed  for  the  en- 
teruinment  market  could  provide  a  profound  leverage 
for  the  information  domain.  For  example,  a  broad  con¬ 
sumer  demand  for  access  to  high  bandwidth  entertain¬ 
ment  (and  other)  services  could  accelerate  the  national 
instalbtion  of  fiber-optic  cables.  Once  in  place,  these 
high  bandwidth  networks  could  abo  be  used  as  high 
performance  links  to  super-computers  and  very  large 
data  bases,  and  broadly  distribute  real-time  business,  en- 
gmeering,  and  scientific  data. 

While  insullation  of  fiber-optic  cable  to  a  najor  user 
base  can  take  many  years,  new  or  existing  satellites  can 
cover  huge  population  areas  very  quickly.  A  variation  of 
direct  broadcast  satellite  (DBS)  transmission  is  spot- 
beam  satellite  technology.  In  this  approach,  as  few  as 


three  satellites  could  be  used  to  provide  localized  high 
quality  (HDTV)  signals  to  small  inexpensive  receiving 
devices  in  as  many  as  150  geographic  areas  within  a 
country  the  size  of  the  continental  United  States. 

6.1.3  Professional  Equipment  Manufacturers 

Equipment  manufacturers  who  produce  studio,  produc¬ 
tion.  storage  and  distribution,  and  test  and  measurement 
equipment  will  enjoy  opportunities  to  provide  their  cus¬ 
tomers  with  new  products  and  services  that  can  be  useful 
across  a  range  of  industries.  Digital,  extensible,  scalable 
image  architectures  can  provide  high  value  per  dollar  and 
increased  economies  of  scale. 

The  computer,  medical,  and  ^phics  industries  could 
similarly  benefit  from  harmonious  formats  that  would 
allow  them  to  produce  image  generating,  manipulating, 
managing,  storing,  and  viewing  applications  and  devbes 
at  reduced  cost  and  increased  interoperability. 

Some  specific  indusuial  application  areas  include  secu¬ 
rity  equipment  for  surveillance  and  identification,  and 
pr^uct  and  process  inspection. 

6.1.4  Consumer  Electronics  Manufacturers 

The  introduction  of  digital  technologies  into  consumer 
products  opens  the  way  to  new  and  improved  services 
and  capabilities.  As  the  consumer  market  increasingly 
demands  higher  image  quality  for  both  work  at  home 
(e.g..  personal  computers)  and  entertainment  (e.g..  televi¬ 
sion  and  video  games),  there  will  continue  to  be  incen¬ 
tives  to  push  the  technologies  that  will  bring  a  better 
picture  to  the  consumer. 

This  will  create  opportunities  in  the  receiving  devices, 
the  electronic  components  that  go  into  them  (e.g.,  semi¬ 
conductors.  light  sources  and  modulators)  and  the  sub¬ 
systems  (e.g..  displays,  tuners,  and  signal  processors). 
The  likely  emergence  of  new  product  categories  can  both 
heighten  and  personalize  the  entertainment  experience. 

Ancillary  devices  (e.g..  tape  and  disc  recorder^layers. 
camcorders,  editing,  processing,  sound  systems,  primers, 
scanners,  interactive  peripherals)  will  be  additional 
sources  of  added  value  products. 

It  is  likely  that  computer  control  technologies  will  play 
an  ever  increasing  role  in  home  emenainment  and  infor¬ 
mation  systems.  The  integration  of  all  of  the  equipment 
listed  in  the  preceding  paragraph  in  the  home  entertain¬ 
ment  environment  has  proven  to  be  a  major  problem  - 
and  a  significant  opportunity.  We  have  seen  programma¬ 
ble  remote  0000*0!  devices  evolve  to  replace  the  profu¬ 
sion  of  separate  infrared  conuollers  (TV  tuner,  cable 
tuner.  VCR.  laserdisc,  audio  CD.  radio  tuner,  etc.).  The 
integration  of  the  graphical  user  interface  from  the  world 
of  desktop  computing  with  the  home  entenainment/in- 
formation  system  ha.s  begun. 
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Collaborative  cross*industry  efforts  will  merge  comput* 
ers  into  home  entertainment  networks,  dealing  with  the 
issues  of  component  integration,  connection  to  multiple 
sources  of  enteiuinment  and  information,  user  interibce. 
and  “user  friendly*  programming  of  the  system.  Various 
flavors  of  “personal  computers*  in  the  home  will  be  able 
to  connect  to  this  network  as  well  as  intelligent  appli* 
ances  and  remote  control  devices.  Inexpensive  network* 
able  cameras  will  allow  remote  visual  monitoring;  the 
firont  door,  the  baby’s  room,  etc. 

6.1J  G>mputers  and  Informadon 

Human  vision  provides  the  highest  bandwidth  informa¬ 
tion  inter£ice  to  the  machine  world.  Computer  technol¬ 
ogy  can  serve  as  an  effective  enabling  tool  for  image 
informadon  creaUon,  capture,  processing,  storage  and 
archiving,  access,  transmission,  and  presenadon.  While 
computer  assisted  information  in  the  1980$  largely  fo¬ 
cused  on  text.  data,  and  simple  graphics,  rapid  chtmges 
are  taldng  place  to  support  other  media  (audio,  image, 
animation,  video,  simuladons,  etc.).  This  places  in¬ 
creased  demands  on  computer  performance  and  human 
interface  to  handle  the  significantly  higher  data  content 
in  these  media. 

To  provide  specific  types  of  information  to  users,  new 
classes  of  specially  tuned  information  appliances  will 
likely  develop.  TheM  appliances  will  rely  on  information 
providers  to  collect,  generate,  and  organize  information. 
In  the  education  market,  for  example,  an  information 
appliance  might  be  tuned  toward  providing  everything  a 
student  needs  to  progress  through  a  particular  ci^.  Be¬ 
sides  basic  course  content,  texts,  lecture  notes,  assign¬ 
ments.  etc.,  it  could  make  extensive  use  of  imagery  to 
provide  interactive  multimedia  tutorials,  remedial  help, 
lab  simubtions.  extensive  reference  material,  electronic 
messaging,  and  sman  links  to  classmates. 

In  the  information  age.  a  critical  challenge  is  the  produc¬ 
tive  management  of  the  overwhelming  amount  of  infor¬ 
mation  produced  each  year.  Unfonunately.  images  and 
video  tend  to  make  this  problem  even  greater.  While 
daubase  search  engines  deal  reasonably  well  with  key¬ 
word  searches  and  invened  indexes  on  textual  data,  cor¬ 
responding  tools  for  other  media  have  tremendous 
opponunities  for  improvement. 

Museums  and  libraries  could  use  electronic  file  systems 
to  caulog  and  view  very  high  resolution  images  of  the 
masters.  Sculptures  and  other  three  dimensional  objects 
could  be  shown  on  stereographic  or  holographic  dis- 
pbys.  or  printed  on  very  high  quality  large  format  print¬ 
ers. 

’The  role  of  the  artist  and  graphics  designer  has  changed 
dramatically  as  the  quality  and  flexibility  of  the  “elec- 
ironic  canvas*  has  come  to  emulate  the  various  forms  of 
traditional  media.  Just  as  the  camcorder  has  allowed 
many  budding  cinematographers  to  explore  their  an. 


high  resolution  drawing  tools  with  interactive  training 
are  revolutionizing  electronic  publishing  and  winning 
over  graphic  artists.  Many  artists  are  expanding  into  new 
markets  such  as  videographics  and  animation  from  ihii 
electronic  base. 

Traditional  forms  of  printing  and  publishing  infoimaiion 
delivery  wiU  continue  to  exist  alongside  newer  mediums. 
Electronic  billboards  could  change  messages  by  day  of  i 
week  or  time  of  day.  Electronic  books,  magazines,  CM- 
Jogs,  and  advertisements  can  integrate  interactive  video 
and  other  media  to  tell  a  story,  make  a  point,  or  sell  a 
product  They  can  also  elicit  information  from  the  oaer 
that  can  provide  useful  information  to  the  publisher  (e4-. 
“hard  to  understand  this  cmcept*  “would  like  product  in 
green"). 

6.1.6  Education 

One  strategy  for  promoting  the  use  of  digital  image  tedh 
oologies  in  education  is  to  leverage  high  volume  coih 
sumer  products.  There  is  now  a  real  opportunity  ID 
leverage  scabble,  interoperable,  extensible  consumer 
products  into  the  classroom  and  other  learning  envfrw- 
ments  (e.g..  lab,  home,  library,  tutoring,  group  study). 

Insdiutional  training  represents  the  high  end  of  the  eihi- 
caiional  market.  An  economic  return  on  investment  can 
often  justi^'  the  use  of  expensive  technology  to  maxi¬ 
mize  training  “productivity*  since  the  employee  students 
are  being  paid  wages  while  not  working.  Increased  use  of 
sophisticated  interactive  multimedb  tools  developed  and 
us^  in  these  environments  could  find  derivative  use  in 
public  classrooms  and  the  home. 

6.1.7  Engineering  and  Science 

Engineers  and  scientists  have  traditionally  used  the  hi|^ 
end  of  graphics  and  imaging  systems  for  data  visualiza¬ 
tion,  design,  simubtion  and  scientific  visualization.  This 
will  likely  continue  as  new  uses  expand  into  such  areas 
as  microscopy  and  astronomy. 

This  community  has  often  utilized  high-end  verrions  of 
consumer  technologies  (e.g.,  TV  CRT/Workstatioa 
CRT).  Their  role  in  leading  versus  leveraging  the  next 
generation  of  imaging  systems  is  not  clear.  The  exisience 
of  a  proper  digital  image  architecture  will  reduce  barriere 
across  applications,  pbtfotms.  and  markets. 

6.1.8  Healthcare 

Healthcare  represents  a  growing  cost  concern  for  most 
industrialized  societies.  While  a  digiol  image  architec¬ 
ture  may  not  directly  reduce  costs,  the  judicious  use  of 
images  and  video  can  provide  an  improved  cost/beneflt 
ratio  for  physician  training,  medical  research,  and  gen¬ 
eral  patient  care. 
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High  Tesolution  imaging  can  be  useful  in  radiology,  mi¬ 
croscopy,  patient  monitoring  (especially  during  surgery), 
and  consuitau'on  with  specialists  in  a  remote  location. 

Image  requirements  can  be  very  stringent  Doctors  some¬ 
times  use  a  magnifying  glass  to  look  for  subtle  changes 
in  gray  level  on  on  X-ray.  Image  fidelity  is  critical. ' 

Training  simubtors.  perhaps  utilizing  vinual  reality  tech¬ 
niques.  can  provide  medical  students  with  improved  en¬ 
vironments  for  learning  over  classroom  and  cadaver 
procedures. 

Although  the  spatial  resolutions  and  signal  integrity  re¬ 
quirements  may  exceed  many  other  applications,  the 
healthcare  community  would  like  to  benefit  from  harmo¬ 
nization  with  other  digital  image  architectures. 

6.1.9  Military  and  Aerospace 

Tradidonally,  the  military  and  aerospace  industries  have 
driven  the  high  end  of  the  imaging  market  with  severe 
nission-aitical  requirements.  While  in  the  past  cost  was 
a  concern  second  to  functionality,  new  economics  dictate 
that  more  effective  leverage  be  made  of  existing  stan¬ 
dards.  technologies,  and  products  wherever  possible. 

Typical  applications  include  radar  and  other  tracking, 
surveillance,  flight  simulation,  general  training  simula¬ 
tors.  mission/sicuadon  control  rooms,  instrument  control 
panels  in  aircraft/vehicles.  satellite  inuging.  virtual  real¬ 
ity.  telepresence,  and  cartography. 

Increasing  emphasis  is  being  pbced  on  dual  use  technol¬ 
ogies.  The  community  learning  network  is  one  example 
of  using  advanced  imaging  technologies  for  both  govern¬ 
ment  and  civilian  education. 

6.2  Application  Requirements 

Throughout  the  digital  image  path,  there  are  specific  and 
interrelated  requirements  that  should  be  understood  for 
any  particular  application  or  family  of  applicadons.  A 
standards  architect  or  applicauon  developer  should  be 
aware  of  not  only  current  needs  and  projected  needs,  but 
also  past  infrastruemres.  potentblly  rebted  applicauon 
areas,  and  long  term  technology  uends. 

6.2.1  Latency 

For  any  ^pliadon.  tolerance  to  transmission  path  la¬ 
tency  ^ould  be  considered.  Small  latency  times  can 
impose  increased  compression  and  transport  costs.  Some 
examples  ofxcepable  biencies  follow: 

<  0.1  second  (interactive  flight  simulation,  vinual 
reality,  tebprescnce.  video  games,  cursor  control) 

<  02  seconds  (teleconferencing,  video  phone) 

<  1  second  (gambling  events,  market  dam) 


<  1  minute  (live  events,  news  events,  announcements, 
dau  files) 

<  1  hour  (weather,  general  news,  images) 

<  1  day  (newspapers,  recorded  programs,  phone  list¬ 
ings) 

<  I  week  (catalogs,  general  reference  material,  maps, 
business  directory  pages) 

6.2.2  Synchronization  with  Other  Media 

Traditionally,  motion  pictures  and  video  have  been  con¬ 
cerned  primarily  with  synchronization  between  image 
and  sound  oacb  Future  imaging  architectures  should 
consider  synchronization  requirements  with  other  media 
and  genei^  control  inputs  and  outputs.  Examples  of 
other  medb  used  for  special  ^plications  might  involve 
any  of  the  other  three  human  senses  (touch,  smell,  taste), 
extensions  of  the  first  two  (sight  and  sound),  as  well  as 
physiological  inputs  (e.g..EkC,££G,£MG,  tespiiaiion. 
perspiration,  salivation,  biochemical  levels). 

Both  input  (t.e.,  response  time)  and  output  synchroniza¬ 
tion  should  be  considered.  Accepuble  synchronization 
can  vary  with  image  content.  For  example,  voice  should 
have  excellent  synchronization  when  an  actor’s  lips  are 
seen,  but  less  synchronization  is  needed  when  the  actor  is 
off  camera.  Background  music  can  accommodate  even 
less  synchronization  as  long  as  it  is  not  keyed  to  action  or 
scene  transitions. 

Motion  inputs  (e.g..  physical  controls,  gestures,  head  or 
eye  trxking,  facial  expressions),  and  outputs  (vibration, 
g-forces.  wind)  can  also  have  varying  needs  for  synchro¬ 
nization. 

6.2.3  The  Digital  Image  Path 

Images  flow  through  (and  may  be  stored  in)  the  follow¬ 
ing  five  processes: 

•  Capture/acquisition/creation 

•  Processing 

•  Transport 

•  Reconsffuction 

•  Presentation 

Each  of  these  represents  a  range  of  opponuniiies  for 
industries  and  applications. 

Image  capture/acquisition/creation  includes: 

•  Live  image  capture  (video  cameras  and  film  cameras) 

•  Previously  recorded  motion  images  (telecine) 

•  Previously  recorded  still  images  (fax,  scanners) 
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•  AnificiaJ  images  created  on  gnphics  computer  work¬ 
stations  and  animation  computers 

Factors  aTTccting  image  capture  include  avaikible  light, 
subject  motion,  spatial  resolution,  colorimetry,  and  cost. 

Good  light  sensitivity  is  an  important  factor  in  available- 
light  location  shots.  In  studio  productions,  sensitive  cam¬ 
eras  can  reduce  equipment  and  electrical  requirements 
for  lighting  and  resultant  air  conditioning. 

In  scenes  with  high  subject  motion  like  sporting  events, 
an  image  sensor  configured  with  quick  response  to  mo¬ 
tion  is  imponanL  A  fast  scan  rate  is  the  overriding  factor 
here,  panicularly  when  minimal  single  frame  blur  is  im¬ 
portant  (for  slow  motion  or  single  frame  playback).  Cur¬ 
rent  technology  favors  interlaced  image  sensors  for  this 
type  of  application,  however  post-processing  or  future 
technical  advances  in  image  sensors  can  be  applied  to 
eliminate  interlxe  anifacis  before  the  signal  goes  very 
Car  down  the  image  path. 

The  scan  rate  of  an  image  sensor  should  also  rebte  com¬ 
patibly  to  the  frame  rate  of  its  source  material  (e.g.. 
movie  film),  and/or  the  anticipated  frame  rate  of  the 
viewing  device. 

Spatial  resolution  can  be  expected  to  improve  for  both 
still  and  motion  image  sensors.  As  described  elsewhere 
in  this  Report,  square  grids  and  properly  scalable  array 
geometries  are  important  factors  in  providing  extensibil¬ 
ity. 

Specific  applications  can  require  high  spatial  but  lower 
temporal  resolutions.  Image  scanners  fall  into  this  cate¬ 
gory.  Used  for  medical  X-rays,  hard  copy  scanning.  Him 
conversion,  and  fax.  a  common  characteristic  is  the  need 
for  high  image  integrity  (e.g..  error-free  image  sensors, 
lossless  compression,  minimal  geometric  disionion.  ro¬ 
bust  error  correction). 

An  ideal  image  sensor  would  be  able  to  resolve  the  entire 
range  of  color  tints  and  hues  visible  to  a  human  eye  over 
a  very  wide  dynamic  range.  It  would  also  have  well 
defined  electrical  transfer  characteristics.  Falling  shon  of 
this,  it  is  important  that  the  colorimetric  transfer  charac¬ 
teristics  be  sufficiently  defined  to  accommodate  faithful 
propagation  throughout  the  image  path. 

Future  image  sensors  will  likely  contain  increasing 
amounts  of  on-device  signal  processing  in  the  form  of 
motion  detection,  compression,  and  error  detenion  and 
correction.  They  may  not  be  scanned,  but  interrupt 
driven,  responding  to  changes  in  the  image.  Devices  may 
even  begin  to  take  on  some  functional  characteristics  of 
the  human  retina. 

Processing  includes: 

•  Format  conversion 

•  On-line  and  off-line  editing 


•  Color  and  density  control 

•  Compositing 

•  Image  and  sound  enhancement  and  synchronization 

•  Special  effects  (blowups,  mirror  imaging,  zooms, 
pans,  superimposition,  morphing,  tilling,  speed 
change,  freeze  frames,  dissolves,  wipes,  paint  box. 
etc.) 

•  Siorageandretrieval(cllps.scenes.shows.stockfoot- 
age.  libraries) 

Transport  includes: 

•  Compression 

•  Encoding 

•  Transmission 

•  Movement  through  air,  space,  fiber,  wire,  or  high¬ 
ways 

•  Storage 

•  Reception 

•  Selection 

Reconstruction  includes: 

•  Decoding 

•  Error  correction  and  concealment 

•  Decompression 

•  Synchronization 

•  Image  enhancement 

A  receiving  device  requires  an  image  processing  engine 
to  properly  reconstruct  information  from  the  signal  This 
information  must  be  compatible  with  both  the  presenta¬ 
tion  device  and  local  storage  (e.g..  ope.  disc,  semicon¬ 
ductor). 

Presentation  includes: 

•  Compositing 

•  Overlaying 

•  Buffering 

•  Storing 

•  Manipulating 

•  Displaying 

Presenotion  manipubtion  can  be  spatial  (e.g~  sootn. 
pan.  detailing,  colorization)  or  temporal  (slow  motion, 
still  frame,  fast  scan).  Many  of  these  manipubtions  may 
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be  difficuli  10  achieve  with  compression  schemes  iha(  use 
incremenial  innsmission  or  sub-sampling  techniques. 

Some  applications  require  bi-directional  capabilities. 
Some  examples  are;  interactive  communications,  on-dc- 
mand  programming,  pay-per-view,  and  client/server 
models. 

In  a  client/server  structure,  the  presentation  “client"  de¬ 
vice  may  be  physically  separate  from  the  rcception/re- 
construcu'on  “server."  This  model  might  apply  to  both 
robust  and  upgradable  servers  in  a  home  neighborhood 
or  on  a  computer  network  within  an  engineering  ofllcc 
environment.  In  either  case,  the  server  would  need  to  be 
able  to  interrogate  the  client  so  that  it  could  properly 
reconstruct  the  presentation  information. 

Displays 

6.3.1  Genera]  Considerations 

No  other  component  of  a  digital  image  system  has  more 
impact  on  industries  and  applications  than  the  display. 
More  than  bandwidth  limiiniions.  image  capture,  and 
signal  processing,  the  performance  and  economic  con¬ 
straints  of  available  displays  are  currently  the  greatest 
pacing'factors. 

Applications  that  can  live  within  current  display  con¬ 
straints.  or  rapidly  utilize  or  promote  advancements  in 
both  flat  screen  and  projection  (and  to  a  lesser  degree, 
direct  view  CRTs)  wiU  be  in  the  best  positions  to  prosper. 

PotenuaJ  display  image  sizes  can  range  from  a  wrist 
watch  to  a  planetarium.  General  factors  that  should  be 
considered  in  specifying  a  display  include:  number  of 
viewers,  viewing  conditions,  spatial  and  temporal  resolu¬ 
tions.  pixel  size  and  shape,  lithographed  versus  variable 
picture  elements,  refresh  rate,  brightness,  density,  color 
gamut,  micro  defects,  aging,  reliability,  aliasing,  ani- 
(acts,  aspect  ratio,  overall  display  image  area,  display 
package  size,  power  requirements,  and  cost.  Some  of 
these  factors,  not  already  discussed,  will  be  expanded  on. 

There  are  two  general  categories  of  viewing  environ¬ 
ments;  single  viewer  and  multiple  viewer.  Traditionally, 
single  viewer  display  sizes  have  been  smaller  (<  17 
inches)  and  the  applications  have  been  more  “task"  ori¬ 
ented  and  interactive  (e.g..  computer  display).  Multiple 
viewer  displays  have  l^n  brger  (>  19  inches)  and  been 
more  “entertainment"  oriented  (e.g..  TV)  and  passive. 

Both  of  these  traditions  are  changing  and  will  cotitinue  to 
do  so.  In  particular,  a  proliferation  of  single  viewer  enter¬ 
tainment  displays  (e.g..  personal  TVs  and  games),  and 
multiple  viewer  task  displays  (e.g..  electronic  white 
boards  and  overhead  projection  panels)  will  be  fueled  by 
continuing  advances  in  dlspLny  technologies. 

Viewing  angle  is  an  imponant  factor  in  tuning  displays  to 
applications.  The  viewing  angle  is  a  function  of  display 


size  and  viewer  distance.  For  a  constant  spatial  resolution 
at  the  viewer's  retina,  overall  display  resolution  needs  to 
increase  as  viewing  distance  decreases. 

As  screen  sizes  increase  and  images  get  brighter,  flicker 
becomes  more  of  an  issue  in  scanned  dispbys.  A  72  Hz 
scan  rate  can  produce  noticeable  dicker  with  younger 
viewers  in  some  situations.  At  a  50  or  60  Hz  display  scan 
rate,  screens  with  high  brightness  that  cover  a  wide  field 
of  view  can  produce  objectionable  levels  of  flicker. 

It  is  imponant  to  separate  flicker  produced  from  scan¬ 
ning  a  display  (commonly  a  flying-spot  on  a  CRT),  from 
Other  causes  (e.g..  capture,  conversion,  interbee.  signal 
processing  artifacts). 

Head  mounted  viewing  devices  (glasses  or  goggles) 
could  make  single  user,  low  cost,  high  tesolution  dis¬ 
pbys  practical.  Additionally,  a  viewing  device  with  dual 
displays  (versus  a  mirror  arrangement)  would  have  the 
inherent  ability  to  dispby  stereoscopic  images.  This  type 
of  device  could  have  both  task  and  entcruinment  appli¬ 
cations.  would  have  a  size  and  privacy  advanbge  for 
ponable  applications  (e.g..  ponable  computing,  viewing 
proprietary  videos  on  airpbnes).  and  operate  well  in  poor 
ambient  lighting  situations.  It  could  also  be  the  lowest 
cost  way  to  deliver  high  resolution  images  to  the  early 
consumer  market. 

Future  displays  might  also  provide  stereoscopic  images 
without  special  viewing  glasses  and  virtual  holography 
(stereoscopic  images  with  multiple  viewing  perspec¬ 
tives).  Image  architectures  would  need  to  pay  particular 
attention  to  accommodating  the  biter. 

As  the  market  demands  increasingly  improved  display 
capabilities,  entirely  new  technologies  and  display  struc¬ 
tures  may  come  into  being.  Some  features  which  could 
find  their  way  into  future  displays  include:  directly  ad¬ 
dressable  image  elements,  byered  structures  with  control 
over  picture  element  persistence,  variable  spatial  and 
temporal  resolutions  across  the  surfxe.  on-dispby  scene 
creation  and  manipulation,  fixed  eye  position  displays 
that  map  resolutions  to  match  human  retinas,  eye  track¬ 
ing  displays  that  tune  resolutions  xross  the  surface  to 
produce  an  optimal  image  to  the  viewer. 


6.3.2  Practical  Limits 

A  digital  image  architecture  that  is  extensible  and  inter¬ 
operable  can  be  expected  to  improve  in  quality  over  the 
years.  There  arc  some  practical  limits  beyond  which 
human  visual  perception  can  be  exceeded,  making  fur¬ 
ther  gains  in  hardware  capabilities  nonproductive. 

Although  current  sound  irchnology  parameters  are  close 
to  or  beyond  human  audible  capabilities,  there  is  a  vast 
chasm  of  opportunities  to  be  iillcd  before  we  approach 
our  optical  limits.  There  arc  more  mundane  limitations  as 
well. 
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For  example;  while  one  might  imagine  a  20  meter  diago¬ 
nal  display  with  0.1  mm  pixel  pitch,  there  .vc  practical 
limits  to  both  physical  display  size  (how  many  home 
living/enienainmcnt  rooms  could  support  such  a  large 
screen  . 0  and  human  capabilities  (one  would  need  a  mag¬ 
nifying  gloss  or  binoculars  to  appreciate  such  spatial 
resolution).  On  the  other  hand,  close  examination  of  im¬ 
ages  from  the  old  masters  might  justify  just  such  a  dis¬ 
play.  And  topological  images  might  even  require  that  this 
physically  limited  display  be  manipubicd  to  bring  in 
additional  portions  or  viewpoints  of  the  larger  source 
imageCs). 

6.3.3  Future  Receiver/Display  Possibilities 

During  the  next  twenty  years,  some  receiver/dispby  op¬ 
tions  that  should  be  considered  include: 

Wrist  watch  display 

single  viewer,  close  viewing 
broad  range  of  ambient  light  levels 
low  pixel  count 
low  power 

wireless  signal  reception 

plug-in  capsule  with  pre-recorded  programming 

low  to  modest  signal  processing 

Personal  viewing  device 

eyeglass  "heads-up"  display 

resolution  rrutching  the  human  retina 

stereoscopic  image  capable 

head  tracking  capable  (signal  bandwidth  permitting) 

Home  entertainment  display  (classical  HDTV) 

large  size  (two  meters  diagon:il) 

viewing  at  moderate  distances 

medium  to  high  ambient  light  levels 

projection  technology  transitioning  towards  Hat  panel 

Physiebn's  work  surface  (simubtes  X-ray  light  wall) 

brge  size  (two  meters  dbgonal) 
close  viewing/fine  pitch 
optional  touch  screen  input 
need  to  magnify  small  portions  of  multiple  images 
composite  of  mostly  still  monochrome  images 
large  dynamic  range  of  gray  scale 
need  to  have  shared  dlspby  regions  with  other  physi- 
ebns 

can  dispby  other  information 

Engineer's  white  board 

large  size  (two  meters  dbgonal) 
close  viewing/fine  pitch 
touch  screen  input 

composite  of  multiple  image  windows 
varbbic  spatial  and  temporal  resolution  over  screen 
sections 


Drafting  table 

brge  size  (1.3  meters  diagonal) 
close  viewing/fine  pitch 
pcnAouch  input 
monochrome  acceptable 
low  temporal  resolution 

Writer's  work  uble 

moderate  size  (one  meter  dbgonal) 
presents  multiple  “pages"  of  documents 
pen/touch  bput 

affords  easy  cut  and  paste  (reducing  paper  drafts) 
text  oriented,  with  some  image  capability 
close  viewbg/Ttrte  to  moderate  pitch 
monochrome  accepmble 
low  temporal  resolution 

Artist's  canvas 

moderate  size  (one  meter  dbgonal) 
close  viewbg/nne  pitch 

excellent  spatbl  features  (resolution,  contrast,  color 
gamut) 

low  temporal  resolution  (unless  animation  is  requited) 
specialized  input/control/editing  devices 

Make-up  mirror 

range  of  screen  sizes  and  aspect  ratios 
“through-the-screen"  camera  capability 
highly  bteractive  image  processing 
high  temporal,  spatial,  color  resolution 
color  keying  to  cosmetics  and  lighting 

Augmented  imagery  (ancillary  to  main  viewing  suifxe) 

manipulate  ambient  room  lighting  (lighming.  color) 
manipulate  moving  room  lights  for  peripheral  vision 
effects 

physically  animated  room  objects 
multiple  vbwing  screens 

6.4  Toward  the  AAAA  (Anything,  Anytime, 
Anyplace  Appliance) 

At  the  extreme  in  communbations.  imagine  having  a 
global  archive,  switching,  communication,  and  receiving 
infrastructure  b  place  that  would  let  an  mdividual  access 
information  in  any  media  (Library  of  Congress, 
Smithsonbn.  encyclopedias,  technical/^fessional  jour¬ 
nals.  movies  and  TV  programs  from  all  film  libraries,  an 
from  all  major  museums,  company  daabases.  home  vid¬ 
eos.  personal  medical  records,  new  car  bets  with  pic¬ 
tures  and  videos,  weather  maps,  bank  accounts,  sporu 
scores  with  selected  repbys.  visual  stock  performance, 
restaurant  menus  complete  with  aromas  and  a  vbw  from 
your  table,  etc.)  plus  real-time  access  to  any  individual  or 
;roup  of  people  (voice,  video,  shared  display,  etc.) 
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7  Future  Work  and  Other  Issues 


This  Repon  is  in  essence  a  progress  report  on  the  require¬ 
ments  for  establishing  digital  imaging  standards  that  are 
interoperable,  scalable  and  extensible.  In  many  cases 
addiuonal  research  and  further  discussions  will  be  re¬ 
quired  to  resolve  the  remaining  issues.  This  section  iden- 
lines  some  of  those  issues  that  need  to  be  resolved.  These 
provocative  issues  will  benefit  from  discussion  in  the 
SMPTE  engineering  community  at  large.  The  SMPTE 
Standards  Committee  will  dciemine  engineering  com- 
miuee  assignments  to  undertake  a  more  detailed  evalua¬ 
tion  of  these  elements. 

Gating  Technologies — 

A  gating  technology  sets  the  pace  for  advancements  in 
technology  products  and  systems.  For  most  of  the  history 
of  television,  the  display  was  the  fundamenul  gating 
technology.  Only  in  recent  years,  has  this  role  shifted  to 
the  transmission  standard  itself. 

It  should  not  be  assumed  that  future  architectures  will  be 
gated  by  display  technologies  over  the  long  term.  Other 
elements  in  the  image  path  should  be  carefully  evaluated 
as  to  their  potential  impact  as  gating  technologies. 

Strawman  Proposals  — 

For  a  robust  digital  architecture  standard  to  successfully 
cross  industries,  applications,  and  time,  it  is  critical  that 
thorough  simulation  and  testing  be  performed  across  a 
range  of  applications. 

At  least  three  diverse  suawman  applications  should  be 
selected  as  test  vehicles.  Interoperability  between  these 
should  be  verified.  Candidate  applications  include: 
broadcast  television,  multi-media  computer  work¬ 
stations.  medical  X-ray.  vinual  reality,  (light  simulation, 
video  phone,  scientific  visualization,  and  client-server 
networks. 

Long  Term  Extensibility  — • 

An  accurate  forecast  of  technologies  and  applications  for 
the  next  fifty  years  is  unlikely.  However,  a  diligent  eval¬ 
uation  of  potentially  relevant  work  underway  in  research 
bboratorics  throughout  the  world,  and  a  careful  study  of 
anthropology  and  market  demographics,  should  help  in 
achieving  tong  term  extensibility. 

Specifically,  breakthroughs  in  image  capture,  display, 
communications,  storage,  and  signal  processing  technol¬ 
ogies  could  all  have  a  profound  effect  on  future  image 
based  applications. 

Frame  Rate  Evalu.ntion  <— 

There  is  a  need  to  gather  existing  data  and  perform  ex¬ 
periments  to  determine  desirable  frame  rates  fur  a  range 
of  services  including  sports  and  other  high  temporal 
events.  Bad  numbers  for  conversion  to  or  from  existing 
standards  should  be  explored  so  that  they  can  be  avoided. 


Other  Issues  — 

The  questions  which  follow  should  be  considered  in  the 
future  work  leading  to  a  digital  image  architecture.  Tltey 
cover  a  wide  range  of  topics  and  considerations.  Some 
may  ultimately  prove  to  be  important,  others  may  be¬ 
come  ittsigniHcant  due  to  advances  in  technology. 

As  discussed  in  Section  2,  the  concept  of  scalability 
refers  to  the  ability  to  extraa  higher  and  tower  quality 
results  from  a  common  signal  format  The  concept  of 
extensibility  indicates  the  need  to  xcommodate  future 
enhancements  in  systems  due  to  the  rapid  pace  of  tech¬ 
nology. 

Scalability  and  Extensibility  require  that  many  of  the 
following  areas  have  mechanisms  for  increasing  and 
decreasing: 

•  Image  resolution  (7.1) 

•  Image  temporal  rate  02) 

•  Image  byers,  overbys.  and  windows  02) 

•  Compression  quality  level  (7.4) 

.  Data  rate,  in  rebtionship  to  image  quality  OS) 

•  Image  luminance  dynamic  range  (7.6) 

•  Image  colorimetric  range  (7.7) 

-  Inuge.  number  of  active  channels  (7.8) 

•  Audio  quality  (7.9) 

•  Audio,  number  of  channels  (7.10) 

In  each  of  the  above  areas,  many  issues  arise  in  evaluat¬ 
ing  efficient  mechanisms  for  increasing  and  decreasing. 

7.1  Image  Resolution 

7.1.1  Arc  simple  fractions  a  good  guide  for  image  ies> 
lution  scaling  and  extending?  If  so.  what  should  be  the 
numerical  basis? 

The  following  issues  suppon  the  use  of  simple  fractions 
when  scaling  resolution: 

a)  The  use  of  raster  aligned  overby  information  (both 
teal  and  vinual  overbys).  as  is  typbal  on  computer 
screens. 

b)  The  use  of  scalabb  compression  techniques  which 
scale  in  octaves. 

c)  The  use  of  picture  detail  near  (v  above  the  Nyquist 
bandwidth  limit,  fur  those  applications  where  appro¬ 
priate  (typical  in  computer  generated  imagery). 
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(t)  The  ability  to  use  simpler  and  lower-cost  transcod¬ 
ing  in  some  situations,  with  less  degradation  (due  to 
phase  alignment). 

e)The  potential  ability  to  get  slightly  improved  trans¬ 
coding  (due  to  phase  alignment)  in  high  quality, 
higher  cost  transcoders. 

7.1.2  Given  that  CCDs,  active  matrix  liquid  crystal 
displays  and  projectors,  and  other  devices  have  inher¬ 
ently  lithographed,  and  therefore  very  fued.  resolutions, 
should  the  numerical  basis  and  specific  (and  optimally 
related)  image  sizes  be  definitively  determined? 

7.1.3  Further,  in  light  of  the  many  such  fixed  size, 
resolution,  and  raster  format  devices,  would  not  square 
pixels  be  an  important  consideration?  How  many  indus¬ 
tries  and  applications  require  square  pixels,  and  would  be 
bun  if  a  non-square  pixel  format  were  chosen  for  ad¬ 
vanced  television?  How  severe  would  the  degradations 
in  quality  and  conversion  cost  be  in  such  a  case?  Con¬ 
versely.  how  many  industries  and  applications  not  requir¬ 
ing  square  pixels  would  be  hurt  if  a  square  pixel 
environment  were  imposed  upon  them? 

7.1.4  If  simple  fraction  resolution  transformation 
guidelines  are  deemed  wonhwhile.  should  there  be  a 
numerical  basis  of  cenain  base  resolutions?  Is  the  hori¬ 
zontal  axis  more  critical  for  the  basis  due  to  digital  de¬ 
sign  considerations?  If  so.  arc  powers  of  two  the  opiim.nl 
basis  for  horizontal  resolutions,  such  as  5 12. 1024. 20^8. 
etc.? 

7.1.5  When  transcoding  resolution,  what  parameters 
are  required  to  perform  optimal  transcoding?  Is  the 
bandlimiting  associated  with  transcoding  at  ratios  other 
than  simple  fractions  an  acccpt.nbie  degradation?  In  what 
industrics/appiications  is  it  acceptable,  and  not  accept¬ 
able? 

7.1.6  CCDs,  active  matrix  liquid  cr>'stal  displays  and 
projectors,  and  computer  generated  images,  and  other 
image  scanning,  generating,  and  displaying  devices  can 
produce  digital  image  values  which  arc  not  bandwidth 
limited.  Funher.*  it  is  common  for  computer  dis*piays  to 
use  text,  w’indows.  and  graphics  which  arc  aligned  to  the 
raster  and  which  use  maximum  bandwidth  signals  such 
as  white  lines  of  pixels  on  bixk  and  bixk  dots  on  white, 
etc.  Given  that  such  non-bnnd-limitcd  signals  are  com¬ 
mon  and  useful  in  many  industries,  is  the  issx  of  requir¬ 
ing  band  limiting  for  transcoding,  compression,  or 
coding  problematic?  What  industries  would  be  signifi¬ 
cantly  hindered  if  high  dellniiion  sy.stcms  raquired  hand 
limiting? 

7.1.7  What  useful  increments  of  scaling  might  he  best  for 
a  resolution  hierarchy?  ^tors  of  two.  being  one  optiirailly 
dccodabie  resolution  per  octivt!  Two  stunples  per  resolu¬ 
tion  KUve  such  as  3/4  and  1/2.  or  .3/2  and  2?  Or  is  continu¬ 


ously  variable  resolution,  and  associated  band  limiting,  a 
requirement  in  some  industrics/appiications? 

7.1.8  Should  an  image  architxiure  emphasize  the  abil¬ 
ity  to  apply  more  resolution  to  some  screen  areas  than 
o^ers?  Or  should  constant  image  resolution  and  quality 
be  mandated  for  all  areas  of  the  image?  Is  the  answer 
different  for  different  industries/applications?  What 
problems  might  arise  if  such  a  signal  format  were  consid¬ 
ered  for  production?  What  issues  arise  within  production 
switchers  for  such  formats? 

7.1.9  If  some  image  areas  are  updated  with  different 
resolutions,  or  temporal  rates,  than  others,  should  the 
universal  header  or  descriptor  contain  this  information 
and  make  it  visible  to  all  devices,  or  is  it  xcepuble  if 
such  information  is  hidden  within  the  dao  stream? 

7.1.10  Are  image  region  rectangular  and  square  struc¬ 
tures.  such  as  that  proposed  as  tiles,  a  useful  construct  in 
providing  imeroperability  and  flexibility  in  image  up¬ 
date? 

7.1.1 1  How  likely  are  future  image  structures  which  are 
not  xy  raster  based  such  as  hexagonal  or  poisson  distribu¬ 
tion  samplings?  Is  it  possible  to  develop  an  image  archi¬ 
tecture  which  has  mechanisms  to  xcommodate  such 
structures  in  the  future?  Can  we  anticipate  the  transcod¬ 
ing  steps  between  a  square-pixel  xy  raster  and  a  uniform 
hexagonal  or  poisson  distribution  raster  and  thereby  do 
our  best  to  allow  for  such  future  possibilities? 

7.1.12  How  completely  should  image  filtering  and  pro¬ 
cessing  histories  be  specified  in  order  to  support  subse¬ 
quent  image  processing  operations?  A  knowledge  of  the 
concatenation  of  all  pre-filters  may  be  desirable  in  com¬ 
plex  image  operations.  How  lengthy  are  such  histories 
likely  to  become? 


7. 1 . 13  How  do  flying-spot  devices  such  as  CRTs  and 
cathode-ray  cameras  relate  to  fixed  raster  "lithographed'* 
devices  sxh  as  CCDs  and  xtive  mau’ix  liquid  crysul 
displays  and  projectors?  How  can  the  high  definition 
image  architecture  xcommodnic  both  types  of  image 
sources  and  displays  without  substantial  quality  loss? 
Can  both  types  of  image  data  be  processed  in  the  same 
transforming  devices  using  the  same  pnrtimeters.  or  arc 
different  processing  steps  required  for  the  two  different 
types  of  image  data? 

7. 1 . 14  Is  there  a  representation  appropriate  in  which  an 
idealized  pixel  can  be  generated  tltrough  signal  process¬ 
ing?  The  purpose  of  such  an  idealized  pixel  would  be  to 
be  used  as  input  to  resolution  scaling  (also  known  as 
resolution  uunscoding).  Would  the  digital  signal  process¬ 
ing  required  to  create  such  an  idealized  pixel  result  in 
unxccptable  anifxts  due  to  the  footprint  of  the  convolu- 
tkNial  pmces-siiig  kcnicl? 
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7.1.15  In  tnditionoJ  television,  there  was  no  possibility 
of  color  dot  triad  alignment  with  scanlines  or  pixels. 
However,  with  the  advent  of  lithographed  displays,  such 
as  active  matrix  flat  panel  dispbys.  the  rebiionship  be* 
tween  the  color  tnad  and  the  pixel  becomes  exact.  Is 
there  an  optimal  organization  of  color  area  portions  in  the 
context  of  a  digital  image  architecture?  Should  the  colors 
be  overbyed  onto  a  common  area  through  the  use  of 
knslets.  fibers,  or  other  techniques?  Should  the  pixels  be 
adjusted  so  that  they  even  overbp  through  such  optical 
techniques  in  order  to  reduce  blocky  appearance? 

7.1.16  If  the  color  regions  representing  a  pixel  must 
remain  spatially  distinct,  is  there- a  particular  arrange* 
ment  which  is  optimal?  If  so.  should  the  precise  positions 
of  the  color  sub-pixel  areas  be  taken  into  account  in  the 
digital  image  architecture  and  in  the  representation,  cap¬ 
ture.  and  processing  of  the  digital  image  signal?  What 
effect  do  the  gaps  between  color  regions  have?  How  do 
lenticular  or  Fresnel  screens  affect  color? 

7.1.17  If  the  color  regions  representing  a  pixel  must 
remain  spatially  distinct,  could  more  than  one  triad  be 
pbced  within  one  logical  pixel?  Is  there  an  optimal  num¬ 
ber  of  such  sub-uiads.  such  as  perhaps  four?  Is  there  an 
optimal  arrangement  of  such  su^iriads?  If  so.  should  this 
arrangement  be  taken  into  account  in  the  digital  image 
architecture  and  in  the  representation,  capture,  and  pro¬ 
cessing  of  the  digiul  image  signal? 

7. 1.18  If  a  color  space  were  to  support  more  than  three 
primary  colors,  would  there  be  benefit  to  using  four  or 
more  primaries  in  some  approprbte  configuration  as  a 
standvdized  pixel  shape? 

7.1.19  On  a  CRT.  the  spot  shape  is  usually  a  round  or 
ellipsoidal  gaussbn  which  flies  horizontally.  On  a  fbt 
panel  dispby.  the  spot  is  usually  a  square  shape  which  is 
stationao'.  Tliere  may  also  be  dead-zones  between  pixels. 
What  signal  properties  should  be  adjusted  to  take  these 
issues  into  account? 

7.1.20  Could  some  of  these  issues  be  handled  by  the 
use  of  an  idealized  or  standardized  pixel  represenution. 
with  defined  transformations  at  the  receiving  device  ap¬ 
proprbte  to  its  particular  pixel  configuration?  If  so.  how 
should  the  digital  image  architecture  specify  this? 

7. 1.21  What  should  he  done  concerning  similar  issues 
in  CCD  image  sensors? 

7. 1 .22  What  is  the  imptici  of  these  %’arious  issues  on  the 
Kell  factor? 

7.1.23  Would  other  pixel  shapes  such  as  triangle,  hex¬ 
agonal.  and  diamond  have  advantages  for  future  image 
capture  and  dispby  technologies? 


7. 1.24  The  1.333:1  (4  x  3)  aspect  ratio  is  widely  used  in 
television  and  computers.  In  the  motion  picture  industry, 
1J7:1. 1.66:1. 1.85:1.  and  235:1  are  all  commonly  used. 
The  8.5*  X  ir  page  has  an  aspect  ratio  of  about  0.77;1. 
The  European  page  size  t^proaches  0.71:1.  Computer 
display  memories  are  most  simply  wganized  with  aspect 
ratios  which  are  simple  fractions  such  as  1:1.  3:2.  and 
2:1.  Medical  radiology,  still  photography,  newspapers, 
magazines,  books,  and  other  images  have  a  number  of 
commonly  used  aspect  ratios.  How  do  we  achieve  siqK 
port  of  all  of  the  aspect  ratios  in  common  use? 

7. 1 .25  Can  the  header/descriptor  be  used  to  indicate  the 
aspect  ratio  and  resolution  of  an  image,  so  that  the  dis- 
pbying  device  can  do  its  own  version  of  letter  boxing 
(unused  areas)  or  overscan  (discarded  areas)?  For  those 
systems  which  have  compressed  frame  groupings,  how 
could  the  edges  of  the  letterbox  be  protected  from  the 
moving  blocks? 

7. 1.26  What  would  be  the  most  widely  used  mappings 
between  common  aspect  ratios  and  anticipated  common 
screen  sizes? 

7.1.27  If  the  16:9  aspect  ratio  becomes  popular,  what 
mappings  are  likely  for  European  (metric)  and  American 
(English)  sized  paper  pages,  wide  screen  movies  at 
2.35:1.  television  and  film  at  1333:1  (4  x  3),  movies  at 
1.85:1,  and  other  widely  used  image  formats? 

7.1.28  Can  the  digital  image  architecture  siqiport  not 
only  a  wide  variety  of  aspect  ratios  m  the  material  being 
displayed,  but  also  a  wide  variety  of  aspect  ratios  at  the 
receiving  dispby  itself? 

7. 1 .29  If  the  digital  image  architecture  supports  multi¬ 
ple  aspect  ratios,  with  interoperability  between  displays 
at  such  various  aspect  ratios,  what  are  the  key  technic^ 
issues?  Should  the  horizontal  resolution  of  all  aspect 
ratios  be  held  at  simple  fractional  rebtionships.  while 
allowing  the  vertical  resolution  (with  square  pixels)  K> 
vary  in  fine  increments  to  fit  the  exact  aspect  ratio  de¬ 
sired  at  the  dispby? 

7.1.30  The  dbgonals  of  the  camera  apertures  of  com¬ 
mon  film  formats  have  dimensions  as  follows: 


Format 

Diagonal 

3Smm  Full  Aperture 

31.14  mm 

35mm  Academy 

27.16  mm 

3Smm  Still 

4337  mm 

65mm 

5730  mm 

Professional  Roll  Still 

101.1  mm 

There  are  many  high  quality  lenses  in  existence  for  thera 
formats.  Would  it  be  useful  to  keep  these  dimensions  in 
mind  when  developing  lithographed  sensors  such  as 
CCD  arrays? 
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7  J  Image  Temporal  Rate 

7.2. 1  Should  a  single  temporal  rate  be  emphasized?  Or 
is  there  a  need  to  support  multiple  temporal  rates  for 
difTerent  industhes/applications,  or  within  one  applica* 
tion? 

7.2.2  Are  there  sufllciem  mechanisms  avaibble  in  tem* 
poraJ  propenies  of  high  definition  systems  to  handle  the 
issue  of  computer  GIT  displays  requiring  refresh  rates 
higher  than  70  Hz? 

7.2.3  Is  there  a  mechanism  for  reliably  and  consistently 
transforming  high  deHnition  television  imagery  to  24  fps 
film  for  theatrical  release? 

7.2.4  Should  there  be  a  family  of  temporal  rates  which 
are  lebted  by  a  simple  fraction  rule?  If  so.  what  should 
be  the  numerical  basis  of  such  rates? 

7.2J  When  temporally  transcoding,  what  temporal 
beat  frequencies  are  visually  acceptable?  Does  the  12  Hz 
beat  frequency  of  the  3-2  pull  down,  and  its  wide  use  and 
seeming  acceptance,  indicate  that  12  Hz  or  higher  is  an 
acceptable  beat  frequency,  or  are  there  frame  patterns  in 
which  higher  beat  frequencies  are  required  for  acceptable 
viewing? 

7.2.6  What  sort  of  synchronization  mechanisms  are 
optimal  for  digital  systems,  given  that  inherent  digital 
system  flexibility  need  not  require  evety  device  to  be 
locked  to  a  common  master  very-high-frequency  oscilla¬ 
tor  (near  100  MHz)? 

72.7  CCDs,  active  matrix  liquid  crystal  dispbys  and 
projectors,  and  other  devices,  utilize  a  portion  of  hori¬ 
zontal  or  vertical  retrace  intervals  to  tran.sfer  to/from 
frame  buffers.  Future  systems  may  not  require  these  in¬ 
tervals.  How  will  this  iiTect  the  need  to  dedicate  signal 
time  to  these  intervals? 

72.8  Given  that  CCDs,  active  matrix  liquid  crystal  dis¬ 
pbys  and  projectors,  and  other  devices,  have  no  inherent 
flicker  or  update  rate  requirements,  should  temporal  rate 
flexibility  be  pan  of  the  high  definition  architecture? 

7.2.9  Should  an  image  architecture  emphasize  the  abil¬ 
ity  to  use  a  higher  update  rate  for  some  screen  areas  than 
others?  Or  should  consunt  irmige  rate  be  mandated  for 
all  areas  of  the  image?  Is  the  answer  different  for  differ¬ 
ent  industries/applications? 

7.2.10  It  is  common  to  use  a  50%  temporal  duty  sam¬ 
pling  cycte  (180  degree  shutter  in  film  cameras  to  allow 
film  pull-down),  which  provides  a  balance  between  mo¬ 
tion  blur  and  slurpness.  b  not  this  temporal  undcrsampl- 
ing  cenain  to  introduce  temporal  aliasing  during 
temporal  rate  transcoding?  b  not  such  albsing  cenain  to 
appear  as  anifacts  which  occur  at  the  temporal  beat  fre¬ 


quency  rate  (e.g..  a  50  Hz  to  60  Hz  transcoding  would 
have  a  10  Hz  beat  frequency)? 

7.2.11  Some  CCD  sensors  used  in  cameras  see  the 
entire  frame  area  during  the  exposure  time.  This  b  sim¬ 
ilar  to  nim  exposure.  Some  tube  cameras  scan  the  image 
top  to  bottom,  whether  progressively  or  imerbeed.  What 
are  the  temporal  processing,  dbpbying,  and  viewing  ef¬ 
fects  caused  by  mixing  devices  which  integrate  the  entire 
image  versus  those  that  scan  the  image  from  top  to  bot¬ 
tom?  What  temporal  issues  arise  due  to  the  fact  that  the 
top  of  the  image  may  be  seen  or  displayed  nearly  a  frame 
time  before  the  bottom  of  the  image,  and  half  a  frame 
time  before  the  center  of  the  image? 

7.2.12  As  just  mentioned,  both  dispbys  and  sensors 
exist  which  scan  from  t^  to  bottom  or  which  integrate 
the  entire  image  for  the  frame  time.  What  architectural 
issues  should  be  examined  in  attempting  to  take  thb  issue 
into  account?  What  issues  are  involved  in  converting  a 
scanned  image  for  area  dispby  or  in  convening  an  area 
sensed  image  for  scanned  dispby?  How  do  these  issues 
affect  film  scanning  such  as  in  a  telecine?  Docs  the  wipe 
time  involved  in  the  physical  film  camera  shutter  have  an 
affect?  How  do  these  issues  affect  film  recording  from  an 
electronically  captured  moving  image? 

7.2.13  How  do  these  scanning  pattern  issues  affect 
temporal  transcoding,  finding  motion  vectors  for  com¬ 
pression  or  standards  conversion,  effects  processing,  or 
other  image  processing  operations?  What  issues  arise 
when  compositing  or  mixing  multiple  image  sources 
captured  with  different  scanning  patterns? 

7.2.14  Standards  converters  which  convert  between 
525/60  and  625/50  are  available  at  various  levcb  of 
cost/pcrformance,  utilizing  a  number  of  techniques.  At 
the  highest  levels  of  performance,  motion  estimation 
may  be  employed  to  interpolate  frames.  Undersampling 
due  to  interlace  may  have  an  impact  of  thb  process.  What 
problems  may  thus  arise  for  architectures  that  rely  on 
temporal  transcoding  or  standards  conversion? 

7.2.15  When  displaying  multiple  windows  of  moving 
images  on  a  screen,  as  in  a  future  video  teleconference, 
how  can  buffering  be  minimized  for  each  picture  stream 
in  order  to  achieve  dispby  synchronization?  What  op¬ 
tions  are  available  for  local,  regional,  and  global  syn¬ 
chronization.  both  loose  and  near  exact? 

7 .2. 1 6  Is  there  a  benefit  to  selecting  a  particular  master 
oscillation  rate,  from  which  pixel  clocks  in  the  scalable 
system  are  derived?  If  so.  what  candidate  rates  might 
offer  advantages? 

7.2.17  Some  applications,  such  as  teleconferencing, 
interactive  flight  simubtion.  or  virtual  reality,  require 
k)w  latency.  Other  applications,  such  as  broadcast  televi¬ 
sion.  can  have  suhsuniial  bicney  without  much  problem. 
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Wha«  digital  image  architecture  mechanisms  are  needed 
10  provide  for  those  applications  which  require  low  la* 
lency? 

7.2.18  Compression  algorithm  design  is  significantly 
affected  by  a  low  latency  requirement.  What  latency  is 
implied  by  any  candidate  digital  advanced  television  sys* 
lem?  What  affect  would  such  inherent  latency  have  on 
usefulness  for  those  applications  requiring  low  btency? 

7.2.19  Digital  networic  design  is  affected  by  needs  for 
real-time  bandwidth  as  well  as  latency  requirements. 
How  does  the  need  for  low  latency  combined  with  high 
real-time  bandwidth  in  these  industries  affect  digital  in- 
teracu've  network  design? 

7J  Image  Layers,  Overlays,  and  Windows 

7.3.1  In  some  workstations,  one  to  four  bits  (usually 
two  or  four)  are  used  for  overlay  planes,  which  are  added 
on  top  of  the  underlying  image  (which  is  usually  full 
color).  However,  many  popular  systems,  like  the  Macin¬ 
tosh  Us.  the  DECstation  50009.  and  others,  do  not  use 

,  Overby  pbnes.  It  is  possible  for  X- Windows®  and  other 
window  management  systems  to  suppon  overby  pbnes 
as  well  as  managing  full  color  windows  in  the  main  bit 
planes?  Should  overlay  planes  be  pan  of  digital  ad¬ 
vanced  television  architectures? 

7.3.2  If  overby  pbnes  are  used,  should  these  overlay 
pbnes  be  implemented  in  hardware,  or  as  a  virtual  mech¬ 
anism  in  software,  or  can  both  be  accommodated? 

7.3.3  How  many  bits  of  real  or  vinual  overlay  plane 
should  be  mandated  or  recommended,  if  overlay  pbnes 
are  mandated  or  recommended? 

7.3.4  The  common  practice  of  bandwidth  limiting  mov¬ 
ing  images  suggests  the  possibility  of  using  overlay 
planes  to  contain  non-band-limited  imagery,  with  the 
band-limited  moving  images  using  the  main  bit  pbnes 
underneath.  Overlay  pbnes  could  easily  contain  the  usu¬ 
ally  non-band-limited  window  borders,  text,  stipple  pat¬ 
terns.  graphics,  etc.,  which  characterize  computer 
screens.  This  architecture  would  require  all  receiving 
devices  to  support  either  real  or  virtual  overby  planes.  Is 
such  an  architecture  appropriate? 

7.3.5  Is  it  necessary  for  interoperability  across  indus¬ 
tries  and  applications  to  allow  for  the  possibility  of  non- 
band-limited  picture  data  co-existing  with  band-limited 
imagery  from  cameras  or  other  (possibly  synthetic) 
sources? 

7.3.6  Do  appropriate  digital  image  compression  algo¬ 
rithms  exist  which  can  pass  non-band-limiicd  picture 
data  such  as  that  used  typically  on  computer  screens?  If 
such  a  compression  technique  exists,  would  this  allow 
such  non-band-limited  picture  data  to  be  transmitted  to¬ 


gether  with  the  moving  picture  stream?  What  are  the 
properties  of  such  a  compression  algorithm,  if  one  ex¬ 
ists? 

7.3.7  Can  the  dam  areas  avaibble  in  some  digital  ad¬ 
vanced  television  proposals  be  used  to  convey 

data  for  use  with  real  or  vinual  overby  planes?  Would 
Unix  X- Windows®.  Dispby  Postscript®.  Apple  Macin¬ 
tosh  Toolbox®.  Microsoft  Windows®,  lax.  or  other  forms 
of  encoded  graphic  and  text  data  such  as  run-length 
codes  be  conveyable  in  this  manner?  Are  there  one  or 
more  such  techniques  which  might  be  appropriate  lo 
suppon  for  digital  advanced  television? 

7.3.8  h  the  proprietary  nature  of  many  of  these  ibrmats 
a  banter  to  inten^erabUity.  or  are  there  potential  lolw- 
tions  to  provide  universal  access? 

7.3.9  Are  open  standards  such  as  ICES  for  vector  and 
graphics  images.  COM  for  raster  images,  or  Open  Docu¬ 
ment  Architecture  (ODA)  for  compound  documents 
worth  considering  in  light  of  desire  for  universal  access? 

7.3.10  Should  one  or  a  small  number  of  such  formats 
be  supported  universally?  By  standardizing  on  only  one 
such  format,  all  receiving  devices  would  only  need  to 
suppon  that  single  format.  If  no  such  single  standard  is 
chosen,  then  each  receiving  device  desiring  to  display 
computer-type  window  or  graphics  displays  might  need 
to  suppon  many  or  all  of  the  formats  in  common  use.  Is 
there  a  way  to  encourage  adoption  of  one  or  a  small 
number  of  text  and  graphics  protocol  standards  to  be 
universally  supponed? 

7.3.11  Should  a  digital  image  architecture  require  that 
text  and  graphic  data,  typical  of  computer  dispbys,  be 
able  to  be  passed  to  the  dispby  by  either  the  use  of  real  or 
vOTual  overby  planes  or  appropriate  compression  algo¬ 
rithms  capable  of  passing  this  infomtation?  Should  the 
digital  image  architecture  insist  on  at  least  one  of  these 
two  ways  of  passing  computer  dispby  irtformation? 

7J.12  As  an  alternative  to  screen-tesolution-speciflc 
paphics.  should  all  graphics  be  specified  with  much 
higher  precision  than  the  display?  Such  might  likely  use 
outline  fonts  and  graphic  commands  which  can  do  prtv 
ponional  blending  of  text  with  appropriate  filters,  and 
which  allow  image  derail  to  pbced  between  pixels  or 
lit^  Would  such  rxm-raster-aligned  text  and  graphics, 
with  ^ropriate  filtering,  be  acceptably  legible  and  clear 
compared  to  rwer-^'gned  and  non-band-limited  text 
and  graphics  as  is  typical  of  current  computer  screens? 

7.3.13  If  presctuaiion  of  non-band-limiied  image  infor¬ 
mation.  as  is  typical  of  computer  tfispbys.  is  a  require¬ 
ment.  then  should  multiple  screen  resolutions  be 
supported  for  resolution  scalability?  If  so.  should  the 
simple  fraction  guideline  be  used  for  the  tebtionships  of 
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screen  resoluuons  due  to  ihc  need  lo  preserve  legibility 
and  clarity  of  the  non-bandwidih-limiied  image  data? 

7.3.14  Should  the  capability  for  selecting  among  a 
variety  of  overlay-planes  for  display  be  pan  of  digiul 
advanced  television  architectures?  Could  such  a  selec¬ 
tion  be  useful  for  closed  captions,  sign-language  inserts, 
foreign  language  subtides.  television  program  guides, 
sports  SDtistics.  or  other  picture  augmentation  informa¬ 
tion? 

7.3.15  How  many  such  simultaneous  overby  pbnes 
could  or  should  be  supported? 


7.3.16  Should  more  general  compositing  functions 
other  than  overby  be  supported  at  the  receiving  device  as 
a  pan  of  digital  advanced  television  architectures?  In 
particular,  should  alpha  blending  be  supported?  Alpha 
blending  uses  proportional  blending  at  edges  of  the  over- 
bying  area.  TTtis  technique  is  also  known  as  proportional 
matte  edge  compositing. 

7.3.17  If  such  compositing  should  be  supported,  should 
there  be  limits  or  scene-specification  guidelines  for  the 
amount  of  area  involved  in  mattes  and  proportional 
edges  each  frame,  or  exh  second? 


7.3.18  If  such  compositing  should  be  supported,  how 
many  layers  of  composite  should  be  allowed.  If  the 
aaivation  of  the  composited  foreground  overlays  ts  su^ 
potted  it  is  likely  to  be  controlled  and  specified  by  the 
u^r  at  the  receiving  device.  If  foreground  overbys  are 
transmitted  in  moderate  time  intervals  such  x  a  ^ond 
or  more,  then  the  number  of  overlays  allowed  will  affect 
the  amount  of  buffering  required  at  the  receiving  device. 
Should  such  issues  be  a  part  of  an  image  architecture  for 
digital  advanced  television? 

7J.19  Is  the  concept  of  tiles  and  plates  useful  by  provid- 
ing  compact  data  representation  for  locating  pixels  used 
in  proportional  alpha  blending?  Tiles  and  plates  are 
screen  area  subdivisions  allowing  specification  of  roreen 
locations.  The  concept  of  pbtes  is  similar  to  city 
and  the  concept  of  dies  is  simitar  to  ^ 

pixels  being  similar  to  locations  on  a  gnd  Itud  over  the 
house  lot.  This  organization  makes  addresing  a  given 
pixel,  tile,  or  plate  location  simpler.  It  also  allows  ^I 
focations  on  a  given  street  to  be  easily 
proximity  of  each  house  to  the  next.  Is  this  ^ 

area  subdivision  useful?  Arc  there  other  methods  for 
subdividing  an  image  which  are  appropriate  or  useful  in 
digital  advanced  image  architectures? 

7.3.20  Should  the  use  of  windows  on  the  dispby  of  the 
TKciving  device  be  aniiciptiicd  as  part  of  the  «hiicc- 
ture?  What  potcntbl  affect  would  there  be  on  the  arehi- 
tecture  by  anticipating  the  iwc  of  two  or  more 
within  the  display-  each  independently  coniroll^  ^ 
each  user  at  each  display  ?  Is  the  minimal  funciKiiuiliiy  of 


picturc-in-picture  approprbte.  or  is  the  elaborate  window 
sizing,  positioning,  and  overlaying  capability  of  typical 
compute  window  systems  appropriate,  or  somewhere  m 

between? 

7  3  21  If  windows  are  anticipated,  how  would  dyramic 

sizing  take  into  xcount  the  possible  need  for  stm^c- 
fractional  guidelines  in  resolution  scaling?  Should  itere 
be  notches*in  the  window  sizes  at  simple  fracuon^inis 
to  allow  clearer  and  more  legible  text  and  graphics. 

7  3  22  If  the  local  computer  wishes  to  use  the  overby 
planes,  how  would  such  use  interact  with  ^ote  control 
SrSiese  overby  planes?  Would  the  local  ^srem  have 
priority?  How  would  such  pnority  be  controlled. 

7  3  23  Is  it  not  likely  that  locally  Mutated  synthetic 
COTputer-generaied  images  wiU  often  be  ov«^^  onto 
the  digital  advanced  television  image  stream.  What  filter 
representations  are  appropriate  for  Mtcbng  the  pixel 
representations  of  the  simubted  and  re^  ot  le 
irogery?  Should  idealized  pixels  be  used.  What  should 
be  the  assumptions  concerning  fiying-spot  versus  Itthcy 
graphed  raster  pixel  representations  in  this  context? 
What  are  the  optimal  computer  anti-aliasing  filters  for 
such  composited  images? 

7.4  Compression  Quality  Level 


7  4  1  What  is  the  proper  tradeoff  between  compression 
quality,  data  rate,  image  resolution,  image  dynamic 
range,  image  color  fidelity,  and  image  temporal  rata? 
How  should  these  tradeoffs  be  resolved  for  diffcrennn- 
dustries  and  applications  (e.g..  medicine,  scicmific  visu¬ 
alization.  production,  videoconferencing,  transmission). 


7.4.2  Can  a  family  of  compression  quality  levels  be 
developed  which  allow  spaibl  and  temporal  resolution 
scalability  with  a  byered  coding  technique?  Cw  such  a 
compression  technique  compete  successfully  with  single 
point  solution  or  single  resoluiion/raie  heavily  opumtzed 
systems?  Would  such  a  technique  offer  benefits  to  tnter- 
opcrabiliiy  between  industries/applicaUons?  Are  the  ben¬ 
efits  of  scabbility  and  extensibility  sufficient  to  jusufy 
the  effort  to  develop  a  byered  compression  system  of 
sufficient  quality?  How  much  is  gained  by  compromis¬ 
ing  the  orthogonality  required  for  scabble  byered  sys¬ 
tems  via  the  use  of  non-lincar  terms?  Are  such  non-linear 
data  interactions  likely  to  hinder  other  interoperability 
needs,  in  addition  to  the  desire  for  scalability  and  exten¬ 
sibility? 


7.4.3  How  serious  is  Ihc  problem  of  motion  vector 
interaction  between  motion  vectors  used  in  standards 
conversion  and  motion  vectors  used  in  motion-compen¬ 
sated  compression?  Would  an  image  which  has  been 
transcoded  in  either  resolution,  temporal  rate,  or  both, 
interact  acceptably  with  a  subsequent  digital  compres¬ 
sion  algorithm,  or  would  the  compression  motion  vectors 
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show  severe  errors  and  aliasing  beat  frequencies?  If  the 
image  is  further  transcoded  after  subsequent  dccompres* 
Sion,  would  the  errors  compound  further? 

7.4.4  What  sub>pixel  resolution  is  required  for  motion 
vectors  used  in  mouon*compensaied  compression? 

7.4.5  What  is  the  effect  of  concatenated  compress/de* 
compress  cycles  within  one  algorithm  as  used  when  ex* 
changing  images  multiple  times  between  different 
industries/applications?  What  is  the  effect  of  concate* 
Dated  compress/decompress  cycles  between  different 
compression  algorithms?  Are  the  preliminary  studies 
which  indicate  that  this  might  result  in  severe  degtada* 
tion  correct? 

7.4.5  It  is  common  practice  in  some  compression 
schemes,  such  as  MPEG,  to  use  frame  groupings.  How 
can  live  switching  be  performed  if  frame  groupings  are 
not  aligned?  How  can  misalignment  of  key  intermediate 
anchor  frames  be  prevented  when  performing  multiple 
compress  and  decompress  cycles? 

7.4.7  Do  these  issues  argue  for  commonality  and  com* 
patibility  of  compression  algorithms,  and  a  minimization 
or  elimination  of  temporal  and  spatial  transcoding  in 
processing  images?  If  transcoding  is  applied,  are  simple 
fraction*based  temporal  and  spatial  transcodings  less 
prone  to  degradation  than  arbitrary  fraction  trans* 
codings? 

7.4.8  Can  a  high  resolution  system  architecture  accom* 
modate  the  rapid  algorithmic  and  digital  hardware  ad* 
vances  in  the  state  of  the  art  which  appear  to  be 
irteviuble  exh  year?  If  an  optimal  algorithm  for  this  year 
is  selected,  what  is  the  likelihood  that  this  algorithm 
would  be  obsolete  in  five  or  ten  years  due  to  im* 
provements  in  hardware  or  algorithmic  techniques?  If 
compression  algorithms  are  likely  to  become  obsolete 
every  five  years,  what  high  definition  system  architcc* 
lure  principles  can  be  developed  to  allow  radical  algo* 
rithmic  and  hardware  improvements  as  appropriate?  Is 
the  hcader/dcscriptor  sufficient,  or  are  other  principles 
required  in  conjunction  with  the  basic  compression  algo¬ 
rithm  design  to  allow  extensibility  or  easy  replace* 
ment/upgrade?  Can  new  algorithms  coc.\ist  with  old 
algorithms  while  providing  efficient  bandwidth/spcc* 
trum  usage,  in  light  of  the  fact  that  new  algorithms  nuty 
be  many  times  mote  efficient  than  old  ones?  Can  old 
algorithms  be  continued  in  use  when  their  inefficiency 
approxhes  fxtors  of  four  or  eight  below  optimal  with 
respect  to  the  newest  algorithms  and  hardware?  How  can 
extensibility  be  accommodated,  as  will  certainly  be  re* 
quired.  while  maintaining  service  to  older  devices  which 
require  inefficient  digital  signal  architectures  fora  given 
point  in  time? 

7.4.9  How  likely  will  the  current  DCT  and  sub*band 
systems  advance  x  future  optimal  algorithms  in  five  to 


ten  yexs?  Are  other  algorithms  such  x  frxtal.  wavdet. 
vxtor  quantization  or  other  large  codebook  algotiihns 
likely  to  be  more  efficient  in  some  future  hardware  caps* 
bility  level?  Is  it  likely  that  future  compression  algo* 
rithms  may  be  x  yet  unanticipated?  What  steps  can  be 
taken  to  prepare  for  such  major  shifts  in  compression 
ixhniques.  should  they  xcur? 

7.4. 10  Is  it  likely  that  decompression  chips  in  ttcewieg 
devicx  could  be  programmable?  If  so.  could  updacs  0 
comprexion  algorithms  be  downloaded  xing  headeefide* 
scriptor  support,  or  by  other  software  disuibution  aeil^ 
ods?  Would  such  updates  be  useful?  Would  it  be  uaefiiltt 
place  the  decompression  module  on  a  standardized  eaed. 
so  that  the  chip  itself  could  be  repbeed  x  technoloBr 
advancx? 

7.4.11  Are  there  proposed  signal  formats  based  npon 
very  rapid  panial  frames  which  can  support  muttplh 
receiver  dispby  ratx  from  a  single  signal  without  de^ 
dation  of  any  rate?  Do  such  formats  also  provide  to 
minimum  buffering  of  multiple  asynchronous  sotnoci 
being  presented  on  the  same  display? 

7.4.12  Some  applications,  such  x  the  colorization  of 
movies,  create  dau  which  defines  the  objects  and  toar 
boundarix  and  motion  for  every  frame.  Cx  such  daabe 
useful  in  a  system  architecture  for  compression  oroAer 

USX? 

7.4. 1 3  Is  it  useful  to  gather  macro  information  abtmt  a 
xene  by  encoding  data  such  x  camera  position, moliia. 
and  orientation?  Could  tripod  hud  encoden  be  useful 
for  this  purpose?  Are  there  navigational  tools  wbtdi 
could  be  adapted  for  this  purpose?  Would  such  gtabd 
information  of  sufficient  t^ue  in  compression  aul 
other  uses  to  warrant  the  upturing  of  this  information? 

7  J  Data  Rate  in  Relationship  to  Image  Quafi^ 

7.5. 1  Can  digital  image  compression  algorithms  be  tap¬ 
ered  in  resolution  and  temporal  rate,  while  at  the  sane 
lime  being  usefully  byered  in  dau  rate? 

7.5.2  Is  a  dau  rate  hierarchy  possible  in  this  context? 

7.5.3  There  will  always  be  a  variety  of  bus  rates,  mem* 
ory  bandwidths.  disk  transfer  rates,  and  channel  rates. 
What  are  the  useful  raix  of  a  dau  rate  hierarchy,  if  such 
a  hbrarchy  is  poxibic? 

7.5.4  Is  orthogonality  of  temporal  and  spatial  lesotu- 
tion  vb  a  byered  hierarchical  comprexion  technique 
poxibic?  Is  data  rate  onhogonaiity.  integrated  with  tem¬ 
poral  and  spaibl  resolution,  poxibic? 

7.5.5  Can  other  useful  augmenuiions  be  byered  onto 
the  dau  rate  sxh  x  extra  camera  views,  stereoscopic 
imagery.  :*valuc  depth  information,  blending  exffi* 
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cienu  for  compositing,  additional  dynamic  range  or  im> 
proved  colorimetry? 

7.5.6  Can  high  quality  still  frames  be  acceptably  inter¬ 
leaved  into  the  data  stream  concurrent  with  the  moving 
image  dau  stream? 

7.5.7  Can  alternate  aspect  ratios  be  provided  simulta¬ 
neously  by  an  appropriately  layered  data  stream  con¬ 
struction? 

7  J.8  Can  extra  image  channels  such  as  closed-caption 
sign  bnguage  display  windows,  previews  of  future 
shows,  and  others,  be  acceptably  byered  into  the  data 
stream? 

7.5.9  Can  three  dimensional  image  construction  infor¬ 
mation  be  provided  for  those  receiving  devices  capable 
of  creating  three  dimensional  computer  generated  im¬ 
ages?  Could  such  images,  by  computer  synthesis  at  the 
display,  substitute  for  tiny  details  which  are  not  ade¬ 
quately  captured  with  the  camera  resolution  such  as  red- 
orange  golf  balls  (usually  red  and  blue  have  lower 
resolution  than  green  due  to  chroma-sub-sampiing  in  the 
Y.Cr.Cb  technique)?  Could  computer  graphics  create 
new  interactive  games  or  other  locally  interactive  educa¬ 
tion  or  training  in  this  way?  Will  future  display  devices 
be  likely  to  have  the  capability  to  generate  some  amount 
of  screen  area  containing  three  dimensional  computer 
generated  synthetic  images  and  composite  them  cor¬ 
rectly  into  the  two-dimensional  high  dcfiniiion  back¬ 
ground  image  situation? 

7.5.10  Transpon  errors  are  highly  dependant  on  the 
tnnspon  channel.  What  sort  of  error  protection/correc¬ 
tion  should  accompany  advanced  television  digitally 
compressed  data,  in  light  of  such  data's  extreme  sensitiv¬ 
ity  to  errors?  Are  the  mechanisms  being  developed  in  the 
transpon  header  ponion  of  the  universal  headcr/dcscrip- 
lor  sufficient,  or  should  all  digitally  compressed  picture 
data  contain  inherent  protection  and  correction  proto¬ 
cols? 

7.5. 1 1  How  should  encryption  be  supponed?  Is  public- 
key  encryption  appropriate  and  sufficient?  What  levels 
of  encryption  are  required  for  various  transpon  media 
and  various  uses? 

7.5.13  Packct-rctry  type  networks,  such  as  the  currant 
btemet.  or  Ethernets  with  TCP/IP  cannot  gutirantcc  de¬ 
livery  of  data  for  a  real-time  su’cnm  since  a  pxkct  of  tbia 
may  be  “bumped"  and  must  be  resent.  Real-time  streams 
require  that  be  both  int.x't  as  well  as  “un-time." 
invalidating  packct-rcuy  protocols  which  would  provide 
resent  data  subsequent  to  the  required  time.  It  is  therefore 
likely  that  the  basic  ±\a  network  infrasuTJCturc  will  need 
to  significantly  change  in  order  to  .suppon  raal-timc  im¬ 
agery  and  audio  streams  in  shtircd  channels,  unless 
switched  point-to-point  services  arc  used  due  to  iasuffi- 


cient  shared-channel  infrastructure  technology.  In  light 
of  such  significant  changes,  is  it  possible  to  anticipate  the 
future  networking  protocols  and  techniques  so  that  the 
high  definition  image  architecture  can  be  developed  to  be 
compatible?  Are  pxket  prioritization  and  priority-based 
graceful  degradation  likely  to  be  key  techniques  in  such 
future  networks?  If  so.  how  much  priority  information 
and  priority  verification  and  authorization  is  needed? 
Also,  how  many  levels  of  priority,  and  what  priority 
schemes  might  be  required,  to  optimize  quality  for  all 
users  of  a  shared  channel  as  well  as  priority  packet  rout¬ 
ing  performance?  How  much  guaranteed  l^dwidth  is 
required  by  each  type  of  user,  and  can  such  data 
bandwidth  be  guaranteed?  W*!!!  such  potential  guarantee 
requirements  need  some  amount  of  data  bandwidth  reser¬ 
vation?  Is  it  possible  to  design  appropriate  shared  net¬ 
works  with  a  hybrid  of  reservation  and  non-reservation, 
applied  with  both  reservation  and  non-reservation  por¬ 
tions  of  each  connection's  data  stream?  Is  such  a  tech¬ 
nique  a  reasonable  match  between  prioritized 
compressed  advanced  television  and  shared  dau  net¬ 
works.  such  that  a  certain  amount  of  real-time  bandwidth 
is  guaranteed,  but  an  additional  ponion  is  not  reserved 
but  is  usually  provided? 


7.6.1  As  cameras  and  displays  continue  to  increase 
their  luminance  dynamic  range  beyond  the  100:1  which 
is  now  common,  would  it  not  be  ^sirable  to  be  able  to 
suppon  such  extended  range?  Are  10(X):1  luminance  dy¬ 
namic  range  cameras  and  display  devices  likely  in  the 
next  five  to  ten  years? 

7.6.2  Can  a  given  advanced  television  digital  compres¬ 
sion  algorithm  be  augmented  to  allow  more  bits  for  lumi¬ 
nance.  or  would  the  requirement  for  more  accuracy 
defeat  the  ability  of  the  algorithm  to  provide  the  required 
compression  ratio? 

7.6.3  Can  luminance  transfer  functions  such  as  those 
used  in  CCIR  Rcc.  709  and  SMPTE  240M  be  augmented 
to  provide  extended  bbek  and  white  range? 


7.7.1  Do  there  exist  cases  in  which  red  or  blue  details 
on  dark  backgrounds,  or  yellow  or  cyan  details  on  light 
backgrounds,  would  require  higher  resolution  in  color  for 
both  broadcasting  and  other  industrics/applications? 

7.7.2  What  arc  the  benefits  of  an  RGB  digital  represen¬ 
tation  for  interoperability  across  industries  and  applica¬ 
tions  versus  the  Y.Cr.Cb  (also  called  Y.PrPb  and  YUV) 
color  difference  reprascniatiun  commonly  used  in  televi¬ 
sion? 

7.7.3  What  arc  the  tradeoffs  for  using  wider  gamuts. 
iiKluJing  gainuLs  beyond  the  wul  spectrum,  in  covering 
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the  reaJ  colors?  What  indusirics/applicaiions.  such  as 
perhaps  museums,  colleges,  priming,  phoiognphy.  and 
motion  pictures,  may  require  accurate  color  reproduction 
over  a  color  gamut  which  is  wider  than  is  commonly 
proposed  for  high  definition  television  systems.’ 

7.7.4  What  other  color  represenutions.  such  as  HSV. 
CIE  x.y  or  CIE  u’.v*  arc  useful? 

7.7.5  How  much  precision  loss  xcompanics  a  given 
high  definition  color  transformation  to  and  from  these 
other  represenDtions? 

7.7.6  Is  there  benefit  from  using  a  color  space  which 
supports  color  sensors  and  dispbys  which  use  more  than 
three  color  primahes? 

7.7.7  When  are  linear  color  representations  needed  for 
computations?  What  linear  color  rcpresenuiions  are  ap> 
propriate  in  a  device*independent  context? 

7.7.8  What  color  representations  offer  device  indepen¬ 
dence? 

7.7.9  Can  luminance  or  other  brightness  measure  be 
represented  such  that  it  is  orthogonaJ  to  color  representa¬ 
tion?  Can  such  a  representation  offer  color  invariance 
under  exposure  or  illumination  level  adjustment? 

7.7.10  What  color  representation  is  most  useful  for 
adjusting  a  wider  gamut  of  color  for  a  narrower  gamut 
display?  What  are  the  uadeoffs  between  clipping  to  the 
narrower  gamut,  and  a  softer  adjustment,  similar  to  high¬ 
light  compression  in  the  5-curvc? 

7.7.11  What  are  the  perceptual  uniformity  properties  of 
various  useful  color  spaces? 

7.7.12  Are  Hue.  Saturation  and  Value  rcprcscnt.ntions 
useful  in  these  contexts? . 

7.7.13  How  can  digital  numeric  representation  effi¬ 
ciency  be  optimized,  while  still  allowing  the  possibility 
of  wide  gamut  colorimcu7  in  addition  to  efficient  sup¬ 
port  of  narrower  gamuts. 

7.7.14  What  device  independent  color  spaces  are  most 
efficient  for  compression? 

7.7.15  Luminance,  which  matches  human  color  sensi¬ 
tivity.  is  an  appropriate  repruscnuiiion  of  brightness  near 
the  display.  Other  rcpresaitatioiis  may  be  required  in  the 
studio  where  processing  is  required.  For  example,  blue 
screen  compositing  involving  transparency  requires  as 
much  detail  in  blue  as  in  green.  How  siiould  the  division 
be  made  between  the  asc  of  luminance  vasas  an  equal 
representation  of  red.  green  and  blue? 


7.8  Image,  Number  of  Active  Channels 

7.8. 1  As  efficiency  of  compression  improves,  if  ways 
arc  found  to  allow  new  compression  techniques  to  re¬ 
place  old  ones,  then  new  data  bandwidth  will  be  made 
available  with  each  such  upgrade.  This  bandwidth  can 
carry  more  active  image  channels.  Such  channels  can  be 
of  quality  equivalent  to  the  original  if  a  factor  of  two  is 
gained  in  the  upgrade.  This  would  make  possible  the  two 
required  image  streams  for  stereoscopic  images.  Alterna¬ 
tively.  or  additionally,  a  number  of  lower-quality  chan¬ 
nels  could  be  added  with  alternate  views  of  the  scene 
from  different  cameras  (cockpit  camera  in  a  car  race  or 
helmet  camera  in  football,  eloseups.  long  shots,  etc). 
Also,  entirely  different  programming  is  possible  on  the 
new  data.  Can  an  architecture  be  developed  which  will 
allow  this  evolution? 

7.8.2  Is  the  header/descriptor  mechanism  likely  to  be  « 
major  element  in  providing  such  augmented  capabilities, 
or  do  the  issues  extend  into  the  nature  of  i^giadaUe 
compression  algorithms? 

7.9  Audio  Quality 

7.9. 1  Musicam  (used  in  MPEG)  and  other  compressed 
audio  systems  are  also  likely  to  improve.  Similar  issues 
of  upgradabiliiy  and  backward  compatibility  must  be 
considered  when  evaluating  the  system  architecture.  Can 
the  image  architecture  for  this  upgrade  path,  if  one  is 
found,  be  applied  as  effectively  to  audio? 

7.9.2  As  compression  techniques  improve,  a  given  level 
of  quality  can  be  maintained  while  reducing  data 
bandwidth  requirements.  What  are  the  best  uses  for 
newly  freed  bwdwidth?  Will  older  devices  be  able  lo 
decode  new  algorithms  if  they  are  made  to  be  somewhat 
programmable  from  the  beginning?  Programmable  de¬ 
vices  will  allow  algorithmic  improvements  which  can 
utilize  a  given  device.  However,  it  is  likely  that  on- 
provements  in  algorithms  will  have  to  be  accompanied 
by  new  hardware,  thus  making  the  upgrade  path  and 
backward  compatibility  path  difliculL  Are  there  ways  to 
improve  this  situation  in  the  system  architecture? 

7.10  Audio,  Number  of  Channels 

7. 10. 1  Additional  audio  channels  can  be  added  in  ex¬ 
panding  bandwidth  due  to  more  efficient  algorithms  or 
due  to  increased  bandwidth  or  higher  reliability  chan¬ 
nels. 

7. 10.2  Likely  uses  for  additional  channels  are  six-chan¬ 
nel  surround  sound,  and  multiple  languttgcs  on  separate 
tracks. 
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8  Annex 


8.1  Glossary  of  Terms 

Alias -a  tom  of  image  distortion  associated  with  spatial 
and  temporal  Hltering.  A  common  fonn  of  aliasing  is  a 
staifstepped  appearance  along  dbgonal  and  curved  lines. 
See  Scdingiintcrpolation. 

Compression  -  the  process  of  removing  redundancies  in 
a  digital  data  stream  to  reduce  the  amount  of  dau  that 
must  be  stored  or  transmitted.  The  following  terms  are 
often  used  in  describing  image  compression  systems: 

Lossless  Compression  >  techniques  for  data  reduction 
in  which  the  origirtal  information  can  be  recovered 
exactly  as  it  existed  prior  to  encoding. 

Lossy  Compression  -  techniques  for  data  reduction  in 
which  some  information  may  be  lost  in  the  process  of 
encoding  or  decoding  the  data.  In  image  compression, 
an  effon  is  made  to  preserve  as  much  of  the  visually- 
significant  data  as  is  possible,  sacrificing,  when  nec¬ 
essary.  only  dau  unlikely  to  be  perceived  by  the 
average  viewer. 

Scalable  Coding  -  the  ability  to  encode  a  visual  se¬ 
quence  so  os  to  enable  the  decoding  of  the  digital  data 
stream  at  various  spatial  and/or  temporal  resolutions. 
Scalable  compression  techniques  typically  filter  the 
image  into  separate  bands  of  spatial  and/or  temporal 
dau.  Appropriate  dau  reduction  techniques  are  then 
applied  to  each  band  to  match  the  response  character¬ 
istics  of  human  vision. 

Fixed  Data  Rate  Compression  >  techniques  designed 
to  produce  a  data  stream  with  a  constant  data  rate. 
Such  techniques  may  vary  the  quality  of  quantization 
to  match  the  allocated  bandwidth. 

Variable  Data  Rate  Compression  -  techniques  de¬ 
signed  to  produce  a  data  stream  with  a  variable  data 
rate.  Such  techniques  typically  maintain  a  consunt 
level  of  quantization  producing  a  variable  data  rate 
based  on  the  spatial  and  temporal  energy  content  of 
the  images  being  encoded. 

Conditionai  Replenishment  -  a  technique  whereby  vari¬ 
ous  portions  of  the  image  are  updated  at  differing  rates. 
Interframe  coding  techniques  utilize  this  concept  as  they 
eliminate  redundancies  between  frames,  only  storing  or 
transmitting  image  dau  related  to  the  changes  between 
frames.  Conditional  replenishment  occurs  as  image  se¬ 
quences  are  reconstructed  in  dual-ported  fume  memory; 
current  display  designs  typically  scan  the  frame  memory 
at  a  consunt  rate  to  refresh  the  display.  The  following 
terms  are  related  to  the  concept  of  conditiotul  replenish¬ 
ment: 


Addressable  Display  -  a  display  designed  to  allow 
each  pixel  element  to  be  controlled  independently, 
allowing  for  conditional  replenishment. 

Conditional  Acquisition  -  an  image  acquisition  tys- 
icm  that  processes  the  image  dau  and  outputs  infor¬ 
mation  about  changes  as  they  occur,  up  to  the 
maximum  temporal  sampling  rate  of  the  image  sensor. 
Parallel  outputs  from  separate  image  regions  (tiles) 
may  permit  higher  sampling  rates. 

incremental  Update  •  the  practice  of  sending  infor¬ 
mation  only  about  changes  to  an  image  that  occir 
between  temporal  samples;  computer  graphics  wA 
animation  systems  typically  employ  this  technique. 

Multirate  EneotSng  -  the  digital  encoding  of  multiple 
image  components  that  are  presented  to  the  enoote 
at  difTerent  temporal  rates.  For  example,  a  high  ipa- 
tial/Iow  lempor^  resolution  component  may  be  ac¬ 
quired  at  24  fps  while  a  low  spatial/high  temporal 
resolution  component  may  be  acquired  at  72  fps. 

Latency  -  the  deby  (btent  period)  between  the  occurs 
rence  of  an  event  and  its  display  by  an  electrook 
imaging  system.  Each  of  the  following  factors  can 
contribute  to  the  total  latency  of  an  imaging  system;  in 
some  cases  image  distortions  nuy  be  rebt^  to  the 
btency  of  the  system: 

EncodingiDecoding  -  interframe  coding  techniques 
utilized  for  digital  video  compression  requite  the  pro¬ 
cessing  of  multiple  frames  to  eliminate  redundaii^ 
in  the  static  areas  of  the  image  and  to  calculate  motian 
vectors  for  the  motion  components  of  the  anafe. 
Depending  on  the  sophistbation  of  the  coding,  tnier- 
frame  compression  can  introduce  debys  from  two  to 
several  hundred  frames. 

Storage  -  used  here  to  describe  the  access  ^eed  of  a 
digital  image  storage  system.  Fast  access  speeds  re¬ 
duce  the  time  required  to  retrieve  specific  data.  a>- 
ROM  is  said  to  have  high  btency  due  to  iu  slow 
access  speed,  while  hard  disks  have  lower  btency  due 
10  their  faster  access  speeds. 

Synchronisation/processing  <•  the  use  of  Inme  syn¬ 
chronizers.  timebase  correction,  and  digital  proces¬ 
sors  for  visual  effects  introduce  frame  debys  that 
accumubie  as  the  signal  passes  through  a  video  priK 
duction  system. 

Pixel  -  the  smallest  picture  element  of  an  image  (one 
sample  of  each  color  component).  A  digital  dispby  is 
typically  specified  in  terms  of  pixeb  and  color  depth:  the 
number  of  digital  bits  stored  per  pixel.  A  picture  element 
is  also  called  a  pel  in  the  Held  of  image  processing. 
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Quantization  Levels  -  Ihc  prcJcicmunctJ  levels  at  which 
an  analog  signal  con  be  sampled  os  determined  by  the 
resolution  of  the  analog  to  digital  convener  (in  bits  per 
sample)  or  the  number  of  bits  stored  for  the  sampled 
signal.  See;  Sampling. 

Resolution -The  capability  of  an  optical  system,  or  other 
imaging  system,  of  making  clear  and  distinguishable  the 
separate  paru  or  components  of  an  object.  With  respect 
to  the  lebtionship  of  the  human  visua:  system  to  an 
imaging  system  dispby.  several  factors  must  be  taken 
into  consideration: 

Spatial  Resolution  -  the  ability  of  the  display  to  repro* 
duce  adequate  detail  to  allow  the  visual  system  to 
distinguish  the  separate  pans  or  components  of  an 
object. 

Temporal  Resolution  -  the  ability  of  the  display  to 
reproduce  adequate  detail  to  allow  the  visual  system 
to  distinguish  the  separate  pons  or  components  of  an 
object  that  is  moving  through  the  dispby. 

Perceived  Resolution  -  from  the  observer’s  view¬ 
point.  the  apparent  resolution  of  a  display.  This  con¬ 
cept  is  based  on  the  ability  of  the  viewer  to  resolve  all 
image  detail  presented  by  the  dispby.  At  the  ideal 
viewing  distance,  perceived  and  actual  spatbl  resolu¬ 
tion  are  equal;  at  greater  viewing  distances  the  per¬ 
ceived  resolution  is  higher  than  the  actual  spatbl 
resolution  of  the  display. 

Sampling  -  the  first  step  in  the  process  of  convening  an 
analog  signal  into  a  digital  representation.  This  is  accom¬ 
plished  by  measuring  the  v^ue  of  the  analog  signal  at 
regular  intervals  called  samples.  These  values  are  then 
encoded  to  provide  a  digiul  representation  of  the  analog 
signal.  Image  samples  are  usually  called  pixels.  See: 
Quontiaation  Levels. 

Sealing  (spatial)  -  alteration  of  the  spatbl  resolution  of 
an  acquired  image  to  decrease  or  increase  the  number  of 
pixels  used  to  represent  the  image.  Any  of  the  following 
techniques  may  be  used  for  image  scaling,  resulting  in 
the  addition  of  image  artifacts  as  Indicated: 

Interpolation  -  the  process  of  averaging  pixel  infor¬ 
mation  when  scaling  an  image.  When  reducing  the 
sae  of  an  image,  pixels  arc  averaged  to  create  a  single 
new  pixel  that  replaces  two  or  more  adjacent  pixels; 
when  an  image  is  scaled  up  in  size,  additional  pixels 
are  created  by  averaging  the  values  of  adjacent  pixels. 
Interpolation  generally  cau.ses  an  apparent  softening 
of  the  image  when  it  is  increased  in  size,  because  the 
averaging  process  docs  not  create  any  new  informa¬ 
tion. 

Pixel  Replication  -  a  process  used  to  dispby  an  image 
at  a  larger  size  by  repeating  pixels  along  a  horizontal 
line  and/or  repeating  lines  to  iiKrcasc  the  vertical  size. 
For  example  a  320  by  240  pixel  image  can  be  dis¬ 


played  at  640  by  480  size  by  duplicating  each  pixel 
along  a  line  and  then  repeating  the  line:  the  resulting 
image  will  contain  blocks  of  four  pixels  with  the  same 
value. 

RvsatnpUng  -  the  process  of  converting  images  be¬ 
tween  the  spatbl  resolutions  utilized  by  different  im¬ 
aging  systems.  The  process  may  include  interpobtion 
to  correct  for  differences  in  pixel  geometry  or  to  scale 
the  image.  Resampling  is  frequently  used  to  change 
the  density  of  pixels,  typically  measured  in  dots  per 
inch  (DPI),  when  preparing  images  for  printing  using 
halftones  and  color  separations. 

Sub'SampUng  -  bandwidth  reduction  techniques  that 
reduce  the  amount  of  digital  data  used  to  represent  on 
image.  "The  following  techniques  are  commonly  uti¬ 
lized 

Chroma  Sub-Sanytling  —  the  reduction  of  color 
resolution  by  reducing  the  bandwidth  of  color 
difference  signals  as  practiced  in  composite  video 
uansmission  and  recording  systems  or  by  elimi¬ 
nating  some  color  difference  pixel  information  in 
digiul  processing  systems. 

Decimation  (pixel  sub  sampling)  -  the  process  of 
discarding  complete  samples.  The  resulting  image 
is  reduced  in  size  and  may  suffer  from  aliasing. 

Scaling  (rempyra/)-  alteration  of  the  temporal  resolution 
of  a  visual  sequence,  to  decrease  (typical  case),  or  in¬ 
crease  the  amount  of  dau  used  to  represent  the  visual 
sequence.  This  process  may  include  one  or  more  of  the 
following  techniques: 

Interpolation  -  the  process  of  adding  or  deleting  tem¬ 
poral  samples  to  a  visual  sequence  by  averaging  adja¬ 
cent  temporal  samples.  Results  are  poor  when  there  is 
mpid  motion. 

Motion  compensation  -  the  process  of  adding  or  de¬ 
leting  temporal  samples  to  a  visual  sequence  by  pre¬ 
dicting  where  a  moving  object  should  appear  in  the 
resulting  new  temporal  sample. 

Replication  -  the  repetition  of  temporal  dau  to  in¬ 
crease  the  temporal  dis^tby  rate  of  a  visual  sequence. 
Typically  used  in  computer  systems  to  provide  a 
higher  screen  refresh  rate  than  the  tcmporel  rate  of  the 
visual  sequence  being  dispbyed.  Also  used  in  film  to 
video  conversions  to  change  from  the  24  fps  temporal 
mte  of  film  to  the  30  fps  temporal  rate  of  NTSC 
composite  video  (3-2  pull  down). 

Speed  Cluinge  -  pbying  information  acquired  at  one 
temporal  rate  at  a  different  temporal  rate.  For  exam¬ 
ple.  24  fps  film  is  pkiyed  at  2.5  fps  when  it  is  convened 
to  Pal  composite  video  (a  4%  siteed  change). 
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Sub-sampling  —  dropping  entire  temporal  samples 
(e.g..  video  frames)  to  reduce  the  data  rate. 

8.2  Temporal  Rate  Analysis 

This  section  includes  detailed  descriptions  of  some  tech* 
niques  for  temporal  rate  conversions  that  might  be  uti¬ 
lized  in  the  process  of  translating  existing  film  or 
television  pictures  for  dispby  at  higher  refresh  rates. 

8.2.1  59.94  and  60.0  Hz  versus  72  and  75  Hz 

Computers  which  use  CRT  displays  (as  opposed  to  Hat 
panel  displays)  currently  comprise  nearly  ail  of  the  mar¬ 
ket  except  for  poruble  laptop  computers.  Substantial  ex¬ 
perience  with  high  resolution  image  presenution.  with 
wide  Held  of  view  and  high  brightness,  has  increasingly 
led  to  higher  refresh  rates  such  as  72  and  73  Hz. 

The  use  of  72  Hz  is  very  naturally  compatible  with  24  fps 
Aim.  being  exactly  three  times.  The  rate  of  73  Hz  is 
compatible  with  the  common  European  practice  of  uans- 
ferring  moUon  picture  film  to  30  Hz  video  by  nmning  it 
at  25  ^s  or  4%  fast.  A  display  which  has  a  sync  tolerance 
range  of  4%  could  adapt  to  both  72  Hz  and  73  Hz  picture 
rates.  A  4%  tolerance  docs  not  add  undue  cost  to  a  dis¬ 
play.  A  CRT  or  other  display  which  can  present  both  72 
and  73  Hz  progressively  scanned  images  can  be  a  key 
architectural  element  in  a  digital  image  architecture. 
Such  a  display  can  be  used  on  computers  as  well  as  being 
able  to  be  used  for  high  quality  presenution  of  mouon 
picture  film,  and  for  30  Hz  material. 

8.2.2  Motion  Prediction 

Very  expensive  receivers  can  attempt  to  perform  motion- 
predictive  firame  rate  conversion  in  the  receiver.  The 
motion  vectors  which  are  used  in  the  motion  compensa¬ 
tion  ponion  of  the  digital  image  compression  process 
could  be  helpful  in  this  rcgttrd.  Compression  only  re¬ 
quires  statistically  beneficial  correlation  between  the 
motion  vectors  and  image  motion,  while  rate  conversion 
requires  an  accurate  motion  analysis  in  order  to  be  arti¬ 
fact  free. 

8.2.3  Temporal  Undersampling 

It  is  common  practice  to  temporally  undersample.  This 
leads  to  reverse  wagon  wheels  and  many  other  well 
known  temporal  aliasing  artifacts.  In  motion  picture  film 
production,  temporal  undersampling  ukes  the  form  of  a 
near  30%  duty  cycle  camera.  Shutter  angles  near  180 
degrees  are  typical  for  must  production,  where  approxi¬ 
mately  half  the  time  is  u.scd  to  expose  each  frame,  with 
the  other  half  being  used  to  pull  the  film  down  to  the  next 
frame. 

In  video,  the  temporal  light  sampling  time  for  each  pixel 
is  sometimes  adjusuble.  Video  temporal  light-capturing 


duty  cycles  will  vary  from  about  30%  to  about  100%. 
Short  duty  cycles  or  adjustable  duty  cycles  are  only 
available  on  some  cameras,  since  light  sensitivity  is  usu¬ 
ally  reduced.  However,  even  some  home  video  cameras 
have  a  persistence  type  control  to  allow  image  capture 
duty  cycle  adjustment  between  sharp  frames  and  smooth 
motion. 

Some  experts  feel  that  a  30%  exposure  duty  cycle  for 
each  frame  is  the  most  common  choice  since  it  provides 
a  balance  between  image  sharpness  and  smooth  motion. 
It  is  likely  that  some  photography  of  moving  objects  may 
desire  a  short  shutter  time  in  order  to  emphasize  sharp¬ 
ness.  and  to  distinguish  each  image  on  each  frame  sepa¬ 
rately.  Other  uses  such  as  '‘go  motion,**  used  successfully 
in  many  Lucasfilm  productions,  may  favor  smooth  mty 
lion  by  choosing  a  100%  duty  cycle.  The  100%  duty 
cycle  is  achieved  in  this  example  by  a  repeat  motion 
non-real-time  computer  controlled  camera. 

In  computer  graphics,  when  motion  blur  is  simubted,  • 
duty  cycle  is  usually  specified.  Some  standard  software 
rendering  packages,  such  as  Pixar's  Renderman*,  offer  a 
value  from  0  to  1  which  controls  the  image  duty  cycle  for 
the  motion  blur  processing. 

The  common  use  of  temporal  undersampling  virtually 
ensures  some  degree  of  aliasing  on  images  which  have 
periodic  motion  or  object  interaction  near  the  frame  rate 
or  its  harmonics.  There  are  also  some  pathological  cases 
where  image  misrepresenution  due  to  temporal  aliasing 
can  make  motion  vector  analysis  impossible.  When  such 
motion  vectors  are  required  for  de-interlacing  or  other 
standards  conversion  involving  temporal  transcoding, 
such  conversions  become  artifact  prone  due  to  inconect 
motion  analysis.  Temporal  aliasing  is  also  exaggerated 
when  temporal  transc^ing  is  employed  without  the  use 
of  motion  vectors. 

It  is  very  likely  that  temporal  aliasing  artifacts  will  ap¬ 
pear  at  conversion  rate  bnt  frequencies.  Thus,  it  is  nec- 
ess.vy  to  use  beat  frequencies  which  are  at  rates  as  high 
as  possible. 

8.2.4  Summary  of  Temporal  Rate  Aitalysis 

In  a  digiul  image  architecture  which  is  interoperabte  and 
scalable  across  industries  and  applications,  it  is  necessary 
to  accommodate  dispbys  which  exceed  70  Hz  in  refresh 
rate. 

A  choice  of  either  72  or  73  Hz  for  the  dispby  tefresh  rate 
could  be  bertcficial  in  both  advanced  television  systems 
and  computer  uses,  panic uiariy  since  NTSC  may,  in  the 
future,  become  obsolete. 

It  is  likely  that  fiat  panel  displays  such  as  actiNO  matrix 
liquid  crystal  color  di.spbys  may  begin  to  take  hold  in  the 
market.  These  devices  do  not  tuve  characteristic  flicker. 
Thus,  it  is  possible  to  update  the  image  at  low  rates,  such 
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as  24  fps  when  presenting  film  im:igcs  or  25  or  50  frames 
fps  when  presenting  television  images,  without  resulting 
nicker. 

It  is  possible  that  future  display  devices  may  be  devcl* 
oped  which  maintain  the  display  on  an  effectively  indef* 
inite  pixel*by-pixel  basis,  ‘ntese  devices  will  be  frame 
rate  independent  and  will  .permit  even  greater  efficiency 
in  the  distribution  of  visual  communications. 
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STATEMENT  CONCERNING  USER  REQUIREMENTS 

FOR 

SOURCE  CODING  AND  MULTIPLEXING 
FOR  BROADCAST  APPLICATIONS 


Introduction 

It  is  widely  recognized  today  that  broadcasting  is  in 
the  middle  of  a  revolution  leading  towards  the  "all- 
digital"  broadcast  system  of  the  future. 

In  addition  to  the  established  trend  to  all  digital 
operation  in  television  production  and  distribution, 
the  development  of  systems  for  delivering  digital 
television  to  the  home  via  terrestrial,  cable  and 
satellite  digital  networks  is  progressing  rapidly  on  a 
world-wide  basis. 

CCIR  Activities 

In  recognition  of  this  dominant  trend  in  broadcast 
development  and  of  the  importance  and  relevance  of  the 
work  of  ISO/IEC/MPEG,  the  recent  meetings  (May  1992)  or 
CCIR  Study  Groups  10,  11  and  CMTT  appointed  a  Special 
Rapporteur  Group  to  draw  up  Applications  Requirements 
for  digital  source  coding  and  multiplexing  and  to 
prepare  a  Report  on  them  for  transmission  to  the 
appropriate  groups  in  MPEG  (see  doc.  11/135  attached). 

Considerable  progress  towards  the  definition  of  the 
requirements  for  certain  broadcasting  applications  has 
already  been  made  in  the  CCIR.  See  docs.  ll-AHG-DC/16, 
ll-AHG-DC/17,  ll-AHG-DC/29. 

Of  particular  importance  and  the  highest  priority  in 
the  work  of  Study  Group  11,  at  the  present  time,  is  the 
definition  of  system (s)  for  the  broadcasting  of  digital 
television  in  narrowband  (6,7,8  MHz)  terrestrial 
channels.  This  work  is  proceeding  in  Task  Group  11/3. 
In  addition,  accelerated  studies  are  taking  place  to 
define  system(s),  having  a  maximum  of  commonality  with 
these  terrestrial  system (s),  for  the  broadcasting  of 
digital  television  in  cable  and  telecommunications 
channels  and  in  broadcasting  satellite  channels.  This 
work  is  proceeding  in  Task  Group  CMTT/ 2  and  JWP-10-11/5 
respectively. 


r 

I  Nhile  it  is  as  yet  too  early  to  be  precise  about  the 

i  detailed  requirements  for  these  services,  it  is 

appropriate  at  this  time  to  draw  attention  to  the  great 
interest  in  systems  able  to  support  hierarchical  source 
and  channel  coding  to  provide  HDTV,  EDTV  and  CDTV  (STV) 
transmission  and  receiver  options  for  a  broadcast 
^  signal  with  transmission  bit  rates  in  the  range  20-30 

I  Mbit/s. 

System  Exeunple 

As  an  example  of  the  sort  of  system  tinder  discussion, 
(purely  for  illustration) ,  a  hierarchical  (scalable) 
Digital  Terrestrial  Television  Broadcast  (DTTB)  system 
might  provide  an  HDTV  quality  at  24  Mbit/s,  with  a 
nested  EDTV  quality  of  12  Mbit/s  and  a  core  CDTV 
quality  at  4  Mbit/s.  A  common  transport  structure 
would  be  used  for  all  delivery  systems. 

Timescale 

In  recognition  of  the  rapid  time-scale  envisaged  for 
standardisation  within  MPEG,  the  Special  Rapporteur 
Group  is  charged  with  the  production  of  its  first 
Report  concerning  the  requirements  for  broadcasting  and 
secondary  distribution  in  September  1992.  Further 
Reports  will  be  produced  as  studies  progress,  on  this 
and  other  areas  of  interest,  such  as  recording. 

The  schedule  of  meetings  envisaged  for  CCIR  Study  Group 
■  11,  relevant  to  this  work,  is  as  follows: 


-Task 

Group 

11/4 

,  Haromisation 

Oct 

13-15, 

1992 

-Task  Group 

11/1 

HDTV 

Production 

Dec 

14-18, 

1992 

-Task 

Group 

11/2 

HDTV 

Interfaces 

Dec 

14-18, 

1992 

-Task 

Group 

11/3 

DTTB 

Dec 

14-18, 

1992 

The  participation  of  an  MPEG  liaison  representative  in 
these  meetings  in  invited. 


It  is  envisaged  that  a  more  comprehensive  second  Report 
will  be  submitted  to  MPEG  early  in  1993. 
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APPENDIX  G 
TASK  GROUP  11/4 

PROPOSAL  FOR  A  DRAFT  NEW  RECOMMENDATION 

HARMONISATION  OF  DIGITAL  METHODS 
FOR  DELIVERY  SYSTEMS  FOR 
TELEVISION  SERVICES  TO  THE  HOME 
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